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HEPATITIS-C VIRUS TESTING 

I * Technical Field 

% 

The present invention relates to the discovery of new 
types of hepatitis C virus, that we have termed type 3 
{HCV-3) and type 4 (HCV-4) . In particular, it relates to 
the etiologic agent of hepatitis C virus type 3 and 4 , and 
to polynucleotides and immunoreactive polypeptides which 
are useful in inmiunoassays for the detection of HCV-3 and 
HCV-4 in biological samples; and also to the use of 
antigenic HCV-3 and HCV-4 specific polypeptides in 
vaccines . 

1 

Backcrround of the Invention 



Acute viral hepatitis is a disease which may result 
in chronic liver damage. It is clinically diagnosed by a 
well-defined set of patient symptoms, including jaundice, 
hepatic tenderness, and an increase in the sertim levels of 
alanine aminotransferase and aspartate aminotransferase. 
Serologic immunoassays are generally performed to diagnose 
the specific type of viral causative agent. 
Historically, patients presenting with symptoms of 
f hepatitis and not otherwise infected by hepatitis A, 
hepatitis B, Epstein-Barr or cytomegalovirus were 
clinically diagnosed as having non-A, non-B hepatitis 
(NANBH) by default. 
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For many years, the agent of non-A, non-B hepatitis 
remained elusive. It has now been established that many 
cases of NANBH are caused by a distinct virus termed 
hepatitis C virus (HCV) . European Patent Application 
EP-A-0318216 discloses cDNA sequences derived from HCV, 
polynucleotide probes and polypeptides for use in 
immunoassays. Further information is provided in 
European ^plication EP-A-0388232. 

The HC7 genome encodes a large polyprotein precursor, 

which contains structural and non-structural regions. The 

single protein is apparently cleaved into a variety of 

proteins after production. Most of the structural and 

non-structural proteins have now been identified from in 

vitro UNA translation and expression as recombinant 

proteins. The C and E regions encode for nucleocapsid 

structural proteins and for envelope structural proteins, 

respectively. At least five additional regions follow, 

?rtiich encode for non-structural (NS) protein of undefined 

function. The organisation is believed to be as follows 

(A. Alberti, Journal of Hepatology, 1991; 12; 279 to 282) 

5' 3' 
NCR: C : El : E2 : NSl : NS2 : NS3 : NS4 : NS5 

Certain immunoreactive proteins have been described as 

recombinant proteins, for example C22 (in the core 

region) , C33 (in NS3 region) , 5-1-1 and ClOO (both in the 

NS4 region) , and NS5 (NSS region) . Diagnosis of 

hepatitis C is still largely based on methods v^ich detect 

antibodies against the product of the C-100 clone. This 
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clone was llgated with overlapping clones to produce a 
larger viral antigen (CIOO) c rresponding to part of the 
NS3-NS4 genomic region. CIOO was then fused with the 
human superoxide dismutase (SOD) gene, expressed in use as 
a large recombinant fusion protein (Cioo-3) and used on 
solid phase to develop radio-labelled (RIA) and 
enzyme-linJced immunosorbent assays (ELISA) . 

Polynucleotides useful for screening for HCV are 
disclosed in European Patent Specification EP-A-0398748. 
European Patent Specification EP-A-0414475 purports to 
disclose the propagation of HCV in culture cells and the 
production of antigens for use in diagnostics. European 
Patent Specification EP-A-0445423 discloses an improved 
immunoassay for detecting HCV antibodies. 

Blood banJcs in the United Kingdom have recently begiin 
routine testing of blood donors for antibodies to 
components of HCV. One assay involves the detection of 
HCV antibodies to ClOO-3 polypeptides. The ClOO-3 
antibody recognises a composite polyprotein antigen within 
non-structural regions of the virus and is a consistent 
marker of HCV infection. However, in acute infections 
this antibody is unreliable because of the delay 
(typically 22 weeks) in seroconversion after exposiire. 
Furthermore, the ClOO-3 antibody test lacks specificity 
for the hepatitis C virus. 

Second generation antibody tests employ recombinant 
antigens or synthetic linear peptides representing 
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structural antigens from the highly conserved core region 
of the virus as well as non-structural antigens. However, 
it is found that some second-generation ELISA tests can 
yield false-positive reactions. The recombinant 
imnrunoblot assay CRIBA-2) incorporating four antigens from 
the HCV genome, provides a method for identifying 
genuine-anti-HCV reactivity. However, the result can be 
"indeterminate". The present workers have reported (The 
Lancet, 338; Oct. 19, 1991) varying reactivity of 
HCV-positive blood donors to 5-1-1, ClOO, 03 3C and C22 
antigens, and compared these with the results of the 
direct detection of HCV RNA present in the blood san^les 
using polymerase chain reaction (PGR) to amplify HCV 
polynucleotides. However, the work demonstrates that the 
unambiguous diagnosis of HCV infections is not yet 
possible. 

Recently there has been discovered a second type of 
HCV (References 1, 2) called K2 that differs considerably 
in sequence from the published prototype (Reference 3) or 
the first type Kl sequences (References 4 and 5) . 

Summary of the Invention 

The present invention is based on the discovery of a 
previously unknown type 3 and 4 variants of HCV, by a 
comparison of sequences amplified by PCR in certain 
regions of the HCV genome and confirmed by phylogenetic 
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analysis. The invention has thus identified 
polynucleotide sequences and polypeptides which are HCV-3 
and HCV-4 specific. These may be used to diagnose HCV-3 
and HCV-4 infection and should thus be included in any 
definitive test for HCV infection. 

One aspect of the invention provides polynucleotide 
sequences unique to hepatitis C virus types 3 and 4 (HCV-3 
and HCV-4) . The sequences may be RNA or DNA sequences. 
In principal any HCV-3 or HCV-4 specific polynucleotide 
sequence from non-coding, core. El, E2 or NSl-5 genome 
regions can be used as a hybridisation probe. The 
sequences may be recombinant (i.e. expressed in 
transformed cells) or synthetic and may be comprised 
within longer sequences if necessary. Equally, deletions, 
insertions or substitutions may also be tolerated if the 
polynucleotide may still function as a specific probe. 
Polynucleotide sequences such as core, NS3, NS4 and NS5 
which code for antigenic proteins are particularly useful. 

Another aspect provides an antigenic HCV-3 or HCV-4 
specific polypeptide, particularly from the core, NS3, NS4 
or NS5 regions (e.g, the HCV-3 or HCV-4 counterparts of 
ClOO polypeptide, 5-1-1 polypeptide, C33 polypeptide or 
C22 polypeptide or epitopes thereof) or polypeptides 
including these antigens, 

A further aspect of the invention provides labelled 
antigenic HCV-3 or HCV-4 specific polypeptide (or mixtures 
thereof, particularly from the core and NS4 regions) for 
use in an immunoassay. 
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A further aspect of the invention provides antibodies 
to HCV-3 or HCV-4 specific antigens, particularly 
monoclonal antibodies for use in therapy and diagnosis - 
Thus labelled antibodies may be used for in vivo 
diagnosis. Antibodies carrying cytotoxic agents may be 
used to attack HCV-3 or HCT-4 infected cells, 

A further aspect of the invention provides a vaccine 
co3i?)rising immunogenic HCT-a or HCV-4 specific polypeptide. 

The HCT-3 or HCV-4 specific polynucleotide sequences 
may be used for identification of the HCV virus itself 
(usually amplified by PGR) by hybridisation techniques. 

Oligonucleotides corresponding to variable regions in 
the NS-4 region could be used for type-specific PGR. 
outer sense and inner sense primers may be used in 
combination with the two conserved anti-sense primers for 
a specific detection method for HCV types 1, 2, 3 and 4. 

Immunoreactive HCV-3 or HCV-4 specific polypeptides 
(particularly from the core and KS4 regions) may be used 
to detect HCV-3 and HCV-4 antibodies in biological 
samples, and may also provide the basis for immunogens for 
inclusion in vaccines (especially the El polypeptide) . 
The term "peptide" is used herein to include epitopic 
peptides having the minimum number of amino acid residues 
for antigenicity, through oligopeptides, up to proteins. 
The peptide may be a recombinant peptide expressed from a 
transformed cell, or could be a synthetic peptide produced 
by chemical synthesis. 
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In particular, the invention allows blood donor 
screening by conventional assays (using HCV type X encoded 
antigens) to be supplemented with a second test that 
contains two oligopeptides corresponding to first and 
second antigenic regions found in the NS-4 sequence of HCV 
type 3 (positions 1691 to 1708; sequence 
KPALVFDKEVLYQQYDEH and positions 1710 to 1728; sequence 
ECSQAAPYIEQAQVIAHQF) and two derived from the equivalent 
regions of HCV type 2, R(A/V) V(V/I) (A/T) PDKE(r/V) LYEAFDEM 
and ECAS(K/R)AALIEEGQR(M/I)AEML. 

The corresponding HCV-4 antigens from substantially 
positions 1691 to 1708 and 1710 to 1728 may be used for 
HCV-4 detection. 

Thus, the present invention has also identified 
corresponding polynucleotide and polypeptide sequences 
which may be used to identify hepatitis C type 2 viral 
infection. 

Production and detection of the antigen-antibody 
immune complex may be carried out by any methods currently 
known in the art. For example, a labelling system such 
as enzyme, radioisotope, fluorescent, luminescent or 
ch em i luminescent labels may be employed, xisually attached 
to the antigen. Labelled anti-antibody systems may also 
be used. The recombinant antigen may be either used in 
liquid phase or absorbed onto a solid substrate. 

Oligopeptides corresponding to the antigenic regions 
of all three major types may also be used separately to 
serologically distinguish individuals infected with 
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diff Brent HCV types. Such an assay could be in the format 
of an indirect enzyme immunoassay (EIA) that used sets of 
wells or beads coated with oligopeptides of the two major 
antigenic regions for HCV types 4, 3 and 2, and with type 
1 (KPA(V/I) IPDREVLYEEFDEH and RPAV(I/V) PDREVLYQEFDEM and 
ECSQHIP2IEG(M/A)AEQF). Hinor degrees of cross-reactivity, 
should they exist, can be absorbed out by dilution of the 
test serum in a diluent that contained blocking amounts of 
soluble heterologous-type oligopeptides, to ensure that 
only antibody with type-specific antibody reactivity bound 
to the solid phase. 

Draminogens for use in vaccine formulations may be 
formulated according to techniques currently Jcnown in the 
art, including the use of suitable adjuvant and 
immune-stimulation systems. 

Finally, the present invention also encompasses assay 
3cits including polypeptides which contain at least one 
epitope of HCV-3 or HCV-4 antigen (or antibodies thereto) , 
as well as necessary preparative reagents, washing 
reagents, detection reagents and signal producing reagents. 

Description of the Drawincys 

Examples of the invention will now be described by 
way of example only. 

Figures 1 and la give cDNA sequences obtained from 
PGR amplification of a region -255 to -62 of the 5' 
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non-coding region of HCV samples from 18 blood donors and 
a comparison with previously published nucleotide 
sequences (see Table 2) ; sequence numbering corresponding 
to the prototype HCV-1 sequence (ref.4) and previous 
designations of type l or 2 being indicated; 

Figure 2 is a phylogenetic analysis showing 
clustering of the sequences into three types viz; HCV-1, 
HCV-2 and HCV-3 for the 5' KCR results of Figure 1 using 
the maximum likelihood algorithm, shown as an unrooted 
tree. Numbers 1-18 in full circles correspond to blood 
donor sequences E-bl through E-bl8. Numbers 1 to 26 in 
open circles correspond to the previously published 
sequences identified in Table 2 . ; 

Figure 3 is a coiaparison of deduced amino acid 
sequences in the NS-5 region of blood donors (E-bl, £-b2, 
E-b3, E-b7 (type 3) and E-bl2 (type 2) with those 
previously published (Table 2) . Amino acid residue 
numbering follows that of the HCV-l polyprotein (4) and 
uses single letter amino acid codes; 

Figure 4 is a phylogenetic analysis of the NS-5 
region using the maximum likelihood algorithm, shown as ah 
unrooted tree. Symbols are as described for Figure 2; 

Figxire 5 is a comparison of deduced amino acid 
sequences in the NS-3 region of blood donors (E-bl, E-b2, 
E-b6, E-b7 (type 3) with those previously published 
(Table 2). Group 1/1: amino acid sequence of fl, f 3 , f4, 
f5, h2, h3, h4 (one), i2, i3, i4, pi, p2 ; Group 1/2: 
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amino acid sequence of i5; Group 1/3: amino acid sequence 
of h2, h3, h4 (one), h5, f2, p3, il; Group 1/4: amino 
acid sequence of hi (one); Group 1/5: amino acid sequence 
of hi (one) . Numbering, symbols and abbreviations are as 
described for Figure 3? 

Figure 6 is a phylogenetic analysis of the NS-3 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Representative nucleotide secjuences of the 
5 groups of type 1 seijuences shown in Figure 5 coded as 
follows: 19 (full circle) i3; 20 (full circle) i4; 21 
(full circle) h5; 22 (full circle) h3; 23 (full circle) 
hi. Symbols are as described for Figure 2; 

Figure 7 is a con^arison of deduced amino acid 
sequences in the core region of blood donor E-bl (type 3) 
with those previously piiblished (Table 2) . Numbering, 
symbols and abbreviations eire as described for Figure 3; 

Figure 8 is a phylogenetic analysis of the core 
region using the ma-iriTmim likelihood algorithm, shown as an 
unrooted tree. Symbols are as described for Figure 2. 

Figure 9(a) shows nucleotide, and Figure 9(b) shows 
deduced amino acid sequences of HCV type 3 varismts 
amplified from 5 Scottish blood donors (nos. 40, 38, 36, 
26 and 1787) in the putative NS-4 region of HCV 
(nucleotides and amino acid residues numbered as in Choo 
et al. , (1991) - Nucleotide codes: G: guanidine; C: 
cytidine; A: adenine; U: uridine; amino acid codes: A: 
alanine; R: arginine; N: asparagine; D: aspartic acid; C: 



wo 93/10239 



PCT/CB92/02143 



- 11 - 

cysteine; Q: glutanine; £: glutamic acid; G: glycine; H: 
histidine; I: isoleucine; L: leucine; K: lysine; M: 
methionine; F: phenylalanine; P: proline; St serine; T: 
threonine; W: trytophan; Y: tyrosine; V: valine.".": 
sequence not determined; difference from consensus shown 
in bold. 

Figure 10(a) shows a comparison of amino acid 
sequences between residues 1679 and 1768 (Choo et al., 
1991) of the three major variants of HCV. T16, T42, T77, 
T1801, T1825: Scottish blood donors infected with HCV 
type 1; T351: Scottish blood donor infected with HCV type 
2; T59, T940, T810: Scottish blood donors infected with 
HCV type 2; T40, T38, T36, T26, T1787: Scottish blood 
donors infected with HCV type 3; and Figure 10(b) shows 
the derivation of consensus sequences for HCV types 3, 2 
and 1 oligopeptide series. Differences from consensus 
shown in bold. Amino acid codes: A: alanine; R: 
arginine; N: asparagine; D: aspartic acid; C: cysteine; 
Q: glutamine; E: gluteunic acid; G: glycine; H: histidine; 
I: isoleucine; L: leucine; K: lysine; M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; ",": not determined. 

Figtires 11(a) to 11(c) show amino acid sequences of 
noneuneric oligopeptides used for epitope mapping, derived 
from consensus HCV type 3, type 2 and type 1 sequences 
respectively. Amino acid codes: A: alanine; R: arginine; 
N: asparagine; D: aspartic acid; C: cysteine; Q: 
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glutamine; E: glutamic acid; G: glycine; H: histidine; I: 
isoleucine; L: leucine; K: lysine? M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; 

Figures 12a, 12b and 12c show antibody reactivity of 
three sera from blood donors infected with HCV type 3 with 
HCT type 3 -encoded oligopeptides in the antigenic region 
of NS-4 (sequences 1-82 shown in Figure 11a) . Antibody 
reactivity to oligopeptides (x-axis) , recorded as optical 
densities in the range from -01 to 0.75 (and >0,75) 
recorded on the y-axis- 

Figure 13 is a comparison of divergent HCV sequences 
with representative type 1, 2 and 3 sequences in variable 
regions of the 5'HCR. Sequences from -255 to -246, -215 to 
-186, -115 to -102 and -69 to -62 identical to prototype 
sequence. sequence identity with HCV-l; gap 

introduced in sequences to preserve alignment; 
sequence not determined. Origins of sequences : Eg- 1-33 
: Egypt; NL-26 : Holland; HK- 1-4 : Hong Kong; IQ-48: 
Iraq; XX-96 : xxxxx. Figures in parentheses n umber each 
non- identical sequence. 

Figure 14 is a phylogenetic analysis of the 5 'NCR 
region using the mayiTimm li3celihood algorithm, shown as an 
unrooted tree. Sequences 1-17 in solid circles are 
numbered as in Figure 13; previously published sequences 
numbered as in table 1 of (992), Scottish blood donor 
sequences Eb-l-Eb-12 numbered 51-62 in hollow circles- 
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For clarity, only non-identical sequences are shown in 
tree; e.g. Secfuence 1 corresponds to those found in 
samples Eg-16 and E9-29 etc. (Figure 1) . Hollow squares 
are published sequences from Zaire; Hollow small circles 
are sequences from South Africa; Hollow small solid 
circles are sequences obtained elsewhere in the world. 

Figure 15 A/B is a comparison of nucleotide (A) and 
amino acid (B) sequences in the core region. Symbols as 
for Figure 13. single letter amino acid codes are used; 

Figure 16 is a phylogenetic analysis of part of the 
core region using the maximum likelihood algorithm, shown 
as an unrooted tree. Sequences are numbered as in Figure 
14; sequence 30 is that of HC-J8 (Okamato et al. Virology 
188:331 -341) ; and 

Figure 17 shows cleavage patterns for A) Haelll/Rsal 
and B> ScrFI in 5 'NCR. 

I) ANALYSIS OF HEPATITIS C VIRUS AND 
PHYLOGENETIC RELATIONSHIP OF TYPES 1, 2 and 3 
Introduction 

Sequence analysis of the 5' non coding region of 
hapatitis C virus (HCV) amplified from the plasma of 
individuals infected in Britain revealed the existence of 
three distinct groups of HCV, differing by 9-14% in 
nucleotide sequence. Two of the groups identified were 
similar to those of HCV variants previously termed type 1 
and type 2, while the third group appeared to represent a 
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novel virxis type. Sequence comparisons were then made 
between the three virus types in other regions of the 
viral genome. In the NS-5 region, a high degree of 
nucleotide and amino acid sequence diversity was observed, 
with samples classified here as type "3" again forming a 
distinct group that was phylogenetically distinct from 
type 1 and type 2 variants. Type 3 sequences were 
similarly differentiated in the NS-3 and core regions from 
HCT type 1 sequences. The designation of virus types, 
including an observed sub-division of type 1 sequences 
into geographically distinct variants is discussed in 
relation to the new sequence data obtained in this study. 

Discussion 

Replication of nucleotide sequences by polymerase 
chain reaction (PGR) is a recently established technique, 
synthetic complementary primer sequences are hybridised to 
single-stranded DMA on either side of a genome region to 
be copied- The second strand is built up under the action 
of a heat-stable polymerase in the region between the 
primers. Heating then dissociates the two-strands and the 
replication process starts again. The PGR technique 
allows tiny amounts of polynucleotide to the amplified 
provided that there is sufficient sequence information to 
synthesise the primar sequences. 

The major problem associated with the use of the PGR 
to assess sequence variation using the PGR is the 
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p ssibility that mismatches between the primers and the 
variant sequence will prevent amplification. We have used 
several strategies to overcome this problem. For initial 
virus detection, we used primers in the 5 'NCR, which are 
reported to highly conserved amongst type 1 variants (4, 
IX, 13, 16, 23, 24, 26, 33), and between Kl and K2 (23). 
Sequence analysis of the blood donors allowed the 
identification of type 1 and type 2 variants by comparison 
with published sequence data. This analysis also revealed 
the existence of a third "type" of HCV that appeared to be 
as distinct from type 1 as type 2 was (Figs. 1,2; Table 3) . 
Based on our initial tentative classification, we sought 
corroboration of our findings in other (coding) and more 
variable regions of the viral genome. 

Analysis of the NS-S region, which was based on 
several sequences of each of the three types (Figs. 3, 4; 
Table 3), confirmed the existence of 3 major groups, with 
type 3 sequences forming a relatively homogeneous group 
that was quite distinct from types 1 and 2. The proposed 
separation of type 1 sequences into PT and Kl "sub-types" 
and type 2 sequences into, K2a and K2b is supported by this 
analysis, in which the single type 2 blood donor sequence 
obtained in this study appears most similar to K2b. 
Differentiation of HCV type 1 sequences into two groups is 
also clearly shown in the core (Fig. 7) and NS-3 regions 
(Fig. 5) , in both cases with the type 3 sequences appearing 
considerably more distant. 
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THe clustering of phylogenetically distinct groups, 
their mixed distributions in a single geographic area (1, 
7, 22, 27, 35) and our own finding of dual or triple 
infections in individual haemophiliacs all strongly 
suggest that the three types described here are distinct 
viruses rather than simply representing geographical or 
epidemiologically clustered variants of a single, highly 
variable but monophyletic group. 

our own phylogenetic analysis of the S'NCK reveals 
the existence of three distinct groups. This contrasts 
with analyses of coding region, where there appears to be 
a very prominent differentiation of type 1 sequences into 
two "subtypes". However, unlike type 2 and 3 variants, 
the two subtypes are geographically distinct, one sub-type 
can?»rising sequences obtained exclusively from Japanese 
patients, and the other comprising predominantly 
USA/European sequences (Table 2), Indeed the only 
exception to this geographical classification is the HC-Jl 
sequence (26) ; one apparent exception (Pt-1) was obtained 
from a Japanese haemophiliac treated with imported factor 
vm of aSA origin (7,23), which is likely to have 
contained HCV variants corresponding to the other 
sub-type. There is insufficient sequence data to indicate 
whether the two proposed type 2 subtypes, K2a and K2b 
(7,23) represent geographically distinct variants. 

Thp genomic organisation of HCV corresponds to that 
of flaviviruses and pestiviruses , with a single open 
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reading frame encoding a polyprotein that is subsequently 
cleaved into structural and non-structural proteins. Weak 
sequence homologies have been detected with several other 
virus groups that have positive-sense RNA genomes 
(19,21) . Although the overall degree of sequence 
dissimilarity between types 1, 2 and 3 cannot be measured 
by comparison of the small regions of sequence emalysed in 
this study, a rough estimate of the extent of divergence 
in protein coding regions is given by an examination of 
the divergence of the partial core sequence. This shows 
that the difference between HCV type 1 and type 3 core 
region (approximately 10% amino acid sequence divergence) 
is comparable to that which exists between different 
serotypes of the flavivirus, tick-borne encephalitis virus 
(14%; ref.20), but lower than that which is found between 
serotypes of a mosquito borne flavivirus, dengue fever 
virus (33%) , and the West Nile (WN) subgroup (28-43% 
divergence). The 5 'NCR sequences of the different members 
of WN subgroup are also considerably more diverse than 
those of the three types of HCV (=^50% similarity; ref.5), 
although within each of the members e.g. Murray Valley 
encephalitis virus, the 5 'NCR is extremely well conserved 
(>95% similarity; ref.5). On the basis of these 
analogies, we speculate that the major types of HCV 
represent distinct "serotypes", each capable of human 
infection irrespective of the immune response mounted 
against other HCV types. 
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METHODS 



samples . Plasma from 18 different blood donors (E-bl 
through E-bl8) , that were repeatedly reactive on screening 
by Abbott 2nd generation enzymeimmunoassay (EIA) , and 
confirmed or indeterminate by a recombinant immunoblot 
assay (RIBA; Ortho; ref 1) were the principal sair^les used 
in this study. Sequences in the NS-3 region from 5 
anti-HCV positive IVDUs (abbreviated as il - i5 in ref 
31), 5 ha^ophiliacs who had received non-heat treated 
clotting concentrate, and who were also anti-HCV positive 
(hl-h5) , 3 pools of 1000 donations collected in 1983 
(pl-p3>, and 5 separate batches of commercially available 
non-heat treated factor VIII (fl - f5) correspond to those 
described previously (31) . 

Primers, The primers rased for cDNA synthesis and 
polymerase chain reaction (PGR) are listed in Table 1. 
They were synthesised by Oswel DMA Service, Department of 
Chemistry, University of Edinburgh. 

RNA Extraction and PGR. HCV virions in 0.2-1.0 ml volumes 
of plasma were pelleted from plasma by ultracentrifugation 
at 100,000g for 2 hours at 4°C. RNA was extracted from 
the pellet as previously described (2,31). First strand 
CDNA was synthesized from 3ul of RNA sample at 42^0 for 30 
min. with 7 units of avian myeloblastosis virus reverse 
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transcriptase (Promega) in 20ul buffer containing 50 mM 
Tris-HCl (pH 8.0), SmM MgCl2, 5 mM dithiothreitol, 50 mM 
KCl, 0.05 ug/ul BSA, 15% DMSO, 600 uM each of dATP, dCTP, 
dGTP and TTP, 1.5 uM primer and 10 U RNAsin (Promega). 

PGR was performed from 1 ul of the cDKA over 25 
cycles with each consisting of 25 sec. at 94°c, 35 sec. at 
50°C and 2.5 min. at 68°C. The extension time for the 
last cycle was increased to 9.5 min. The reactions were 
carried out with 0.4 unit Tag polymerase (Northumbria 
Biologicals Ltd.) in 20 ul buffer containing 10 mM 
Tris-HCl, pH 8.8, 50 mM KCl, 1.5 mM MgCl2, 0.1% Triton 
X-100, 33 UM each of dATP, dCTP, dGTP and dTTP and 0.5 uM 
of each of the outer nested primers. One ul of the 
reaction mixture was then transferred to a second tube 
containing the same medium but with the inner pair of 
nested primers, and a further 25 heat cycles were carried 
out with the same programme. The PGR products were 
electrophoresed in 3% low melting point agarose gel (IBI) 
and the fragments were detected by ethidium bromide 
staining and UV illumination. For sequence analysis, 
single molecules of cDNA were obtained at a suitaO^le 
limiting dilution at which a Poisson distribution of 
positive and negative results was obtained (30) . 

Direct Sequencing of PGR Products . The PGR products were 
purified by glass-milk extraction ("GeneClean" ; BiolOl, 
Inc.). One quarter of the purified products was used in 
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sequencing reactions with T7 DNA polymerase (Sequenase; 
tmited States Biologicals) performed according to the 
manufacturer's instructions except that the reactions were 
carried out in 10% DMSO and the ten^late DNA was heat 
denatured before primer annealing. 

Phvlogenetic Methods . The sequences were compiled by 
version 2.0 of the programs of Staden C32) and analysed by 
programs available in the University of Wisconsin Genetics 
Computer Group sequence analysis package, version 7.0 
(6) . Phylogenetic trees were inferred using two different 
programs available in the PHYLIP paOcage of Felsenstein 
(version 3,4 June 1991; ref.9). The program DNAML finds 
the tree of the highest likelihood (the maximum likelihood 
tree) given a particular stochastic model of molecular 
evolution and has been shown to perform well in simulation 
studies (28). In the analyses performed here the global 
(G) option was used as this searches a greater proportion 
of all possible trees. The second program used was 
NEIGHBOR which clusters (following the algorithm of Saitou 
& Nei; ref.29) a matrix of nucleotide distances previously 
estimated using the program DNADIST (which itself was set, 
using the D option, to use the same stochastic model as 
underlies DNAML in order to estimate distances corrected 
for the probabilities of multiple substitution) . In all 
cases the maxi"™ likelihood and neighbotir joining 
procedures produced congruent trees and thus only the 
former have been presented here. 
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To establish the interrelationships of the major 
types of HCV, we have separately analysed several regions 
of the viral genome that differ in sequence variability 
and evolutionary constraint. Thus the conclusions drawn 
from the sequence comparisons are not subject to spurious 
evolutionary phenomena that may affect a particular 
region. However, one problem with the analysis presented 
here was the absence of a viral sequence that was 
sufficiently distantly related to HCV to serve as an 
out-group. Thus, although we describe the 
interrelationships of different sequence variants of HCV, 
it should be stressed that we have no means of deciding 
v^ich sequence is ancestral to the others. The trees are 
thus drawn in the less familiar un-rooted form to indicate 
this. 

RESULTS 

1) Analysis of the 5^ non-coding region . Samples were 
obtained from 18 blood donors that were repeatedly 
reactive in the Abbott 2nd Generation enzyme immunoassay 
and which were confirmed or indeterminate in the Chiron 
4-RIBA (E-bl through E-bl8, ref.lO). HCV sequences 
present in stored plasma samples from each donor were 
amplified with primers corresponding to sites in the 5 'NCR 
(12,25) that are well conserved between all known HCV type 
1 and type 2 variants (4,11,13,16,23,24,26,33). 
Sequencing of the PGR product, after limiting dilution to 
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isolate single molecules of cDNA before amplification, 
allowed approscimately 190 bps in the centre of the region 
to be compared with equivalent published sequences (fig.l). 

Within the sequences, constant as well as variable 
regions can be found. Six sequences from donors E-bl3 
through E-bl8 closely resembled those previously described 
as type 1 (4,11,13,16,23,24,26,33) and others resembled 
type 2(23) sequences (E-b9 through E-bl2). However, eight 
sequences (E-bl through E-b8) were distinct from both 
types, and have been provisionally termed type 3. 
Division of the sequences into three types is supported by 
formal phylogenetic analysis using the maximum likelihood 
(fig. 2) and neighbour joining algorithms (data not shown) 
of the blood donor sequences along with previously 
published sequences (identified in table 2) - Sequence 
variability within the three groups is in each case 
considerably less than that which separates the types. No 
sequence intermediate between the three types were found. 
This tree shows that the provisional l y identified type 3 
group is equally distinct from type 1 as is type 2. Using 
the DNML model, the corrected distances between sequences 
within each type were in each case less than 3%. Between 
groups, they ranged from 9% (between type 1 and type 3, 
and between type 1 and 2), to 14% between type 2 and type 
3 (table 3) . 
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2) Analysis of the NS-5 Region . The nucleotide sequenc 
of the NS-5 region has been found to vary significantly 
between the previously described Kl and K2 variants of HCV 
(7) , To investigate whether type 3 sequences were equally 
distant from the other two types in this region as well as 
in the 5 'NCR, we compared sequences from four type 3 blood 
donors (E-bl, E-b2, E-b3 and E-b7) and one type 2 donor 
(E-bl2) with previously published sequences (Fig. 3; 
Fig. 4; table 3} . 

A remarJcable variation was observed between sequences 
of the three types in this region. Again, type 3 
sequences form a separate group from type 1 and type 2 in 
this region. However, unlike the 5 'NCR, there appear to 
be subdivisions within the type 1 and type 2 groups. Type 
1 sequences are split between those found in Japanese 
infected individuals (e.g. HCV- J; HCV-BK; sequence numbers 
12, 13, 16-20 in table 2) and those of U.S.A. origin 
(HCV-1, Pt-1, H77, H90; sequence numbers 1-4; fig. 4). 
There is also some evidence for a split between type 2 
sequences, those corresponding to their previotis 
designation as K2a (7) appearing distinct from type K2b 
sequences and the Scottish blood donor, E-bl2. 

Table 3 shows that the average nucleotide distances 
between the two groups of HCV type i sequences is 25% 
(indicated here as type la [USA] and type lb [Japanese]), 
with variation of only 4-7% within each group. The 
nucleotide sequence divergence within the two type 1 
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groups is similar to that which exists between K2a and K2b 
(table 3). However, both of these distances are 
considerably less than those which exist between type 1 
and type 2 sequences (52-62%) , and type 3 (48-49%) , and 
the distance between type 2 and type 3 sequences (53-60%) . 

3) Analysis of the NS-3 region . Amplification reactions 
were carried out using previously published primer 
sequences in the NS-3 region (37) , and a pair of 
empirically derived inner primers (31) - Although these 
primers amplified HCV sequences from a high proportion of 
anti-C-lOO positive sera from haemophiliacs (31) , they 
were less effective with sera from IVDUs (31) , and with 
blood donor samples (3 positive out of 15 tested r data not 
shown) . Two conserved sites in the aii?)lif led fragment 
were identified by sequence analysis of the NS-3 region 
from the haemophiliac and IVDU patients, and two new 
primers corresponding to these were specified (207, 208; 
Table 1) . The combination of 288-208 (first round) and 
290-207 (second round) primers successfully amplified 
samples from four donors infected with HCV type 3 (E-bl, 
E-b2, E-b6 and E-b7) but none of those infected with HCV 
type 2 (data not shown) . This enabled a comparison of the 
new type with our own (31) and previously published type 1 
sequences (figs. 5, 6? table 3). For clarity, only seven 
of the type 1 sequences obtained in this study (E-bl6, 
E-bl7, 13, 13, h5, h3 and hi) are shown in the tree. 
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These sequences are representative of the range of 
variation found in this region in individuals infected in 
Britain; comparison of the tree previously published (31) 
with fig. 6 shows that the former forms a very small 
component of the overall tree obtained once Japanese type 
1 emd type 3 sequences are added. 

The maximum likelihood tree shows that type 1 and 
type 3 have diverged considerably from each other. As was 
found in the NS-5 region, subtypes of type 1 sequences are 
found in NS-3. Again, sequences of Japanese origin 
(HCV-J, HCV-BK and JH) are distinct from the prototype 
(PT) sequence, and those found in Scottish blood donors 
(E-bl6, E-bl7, pl-3), rVDUs (il-5) and haemophiliacs 
(hl-5) , all of which correspond to the prototype sequence 
(fig. 5). However, the average subtype difference (23%) is 
lower than those that exist between HCV-1 and HCV-J with 
the four type 3 sequences (37-43%) . As reported 
previously (31) , the majority of nucleotide substitutions 
that exist between type 1 sequences are silent (i.e. do not 
affect the encoded amino acid sequence) , while numerous 
amino acid substitutions exist between type 1 and type 3 
sequences (fig. 5). The analysis of the NS-3 region 
includes the sequence of clone A (35) which was obtained 
from Japanese patients with NANB hepatitis, and which was 
reported to be distinct from existing HCV type 1 
sequences. In Fig. 6, this sequence appears to be 
distinct from both HCV type 1 and type 3, with corrected 
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sequence distances of 33-43% and 36% respectively. 
Although it is not possible to assign this sequence to any 
known group at this stage, these distances are not 
inconsistent with the hypothesis that it represents a type 

2 sequence, or an equally distinct novel HCV type. 

4) Partial Sequence of the Putat ive Core Region of HCV. 

The region encoding the putative core protein is 
coaiparatively well conserved in its nucleotide sequence 
between 3aiown type 1 variants, showing nucleotide and 
amino acid sequence similarities of 90-98% and 98-99% 
respectively (11, 24) . Part of the core region from the 
blood donor Ebl, vho has type 3 sequences in other regions 
analysed was an5)lified with primers 410 and 406 and 
compared with previotisly published type 1 sequences 
(fig*7, 8; table 3). This analysis confirms that the type 

3 sequence was distinct from those of type 1, and again 
there was a prominent subdivision of type 1 sequences into 
Japanese (HCT-J, HCT-BK, HC-J4, JH and J7) and 
USA/European (HCV-1, H77, H90, GSSX, cm) sequences. As 
was found in NS-3, very little amino acid sequence 
variation is found in the core regions of type 1 
sequences; almost all of the nucleotide differences 
between the two groups are at "silent" sites. By 
contrast, the type 3 sequence shows 7-8 amino acid 
substitutions on comparison with type 1 sequences. 
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TABLE I 

SEQUENCES AND SOURCES OF PRIMERS USED FOR AMPLIFICATION OF HCV GENOME. 



Name 


Region 


Posinon 


c* b 

Sense 


Sequences 5*-3' 








of S'basc"* 








209 


D NCR 


5 






(12) 


211 


5 NCK 








(12) 


939 


5' NCR 








(25) 


940 


5'NCR 


-279 




TTCACGCAGAAAGCGTCTAG 


(25) 


410 


CORE 


410 




ATGTACCCCATGAGGTCGGC 




-406 


CORE 


-21 


+ 


AGGICICGIAGACCGTCCATCATGAGCAC 






NS-3 


4951 




CCGGCATGCATCTCATGATGTAT 


(31) 




NS-3 


4932 




GTATTTGCTGACTGGGTGCGIC 


(31) 


208 


NS-3 


4662 


+ 


1 C 1' 1 OAA 1" ITlOGOAGGGCUIi: I" 1 




207 


NS-3 


4699 




CATATAGATGCCCACirCCTATC 





007 NS-* 5293 - AACICGAGTATCCCACItjATGAAGTrCCACAT 

J20 NS-4 5278 - CACATGTCCmCGCCCAGAA 



BCVtypeSfi 

221 

008 



NS-4 
NS-4 



4858 
4878 



GGACCTACGCCCCTTCTATA 
TCGGTICKiGGCCTGTCCAAAATC 



HCV type 2 : 

2S1 

509 



NS-4 
NS-4 



4858 
4878 



GGTCCCACCCCTCTCCTCTA 
CCGCrrCGGTrCCGTrACCAACG 



HCVtype i: 

253 NS-4 4858 -r GGGCCAACACCCCTGCTATA 

196 NS-4 4878 + CAGACTGGGCGCCGrTTCAGAATG 
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Table 1 cont.. 



242 


NS-5 


8304 




GCjCGGAATTCCTGGrrCATACjCCTCCGTGAA 


555 


NS-5 


8227 




CCACGACTAGATCATCTCCG 


243 


NS-5 


7904 




TGGGGATCCCGTATCATACCCGCTGCTITCA 


554 


NS-5 


7935 




CTCAACCGTCACTGAACAGGACAT 



"Position of 5*base iclanve to HCV genomic sequence in icf no. (4) 
^Otiemadon of pixmer sequence (-t-: sense; -: ami-sense) 



t Abtscviadons: A: arifminr, c: cyddme: G: gnanfdfne, T: thymidine. 

f Separate sense pnmers leqmred to enable amplification of each HCV type. 
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TABLE 2 

SOURCE AND CITATION OF PREVIOUSLY PUBUSHED HCV SEQUENCES USED IN THIS STUDY 



Geographical 



No. 


Type 


Abbreviation 


Source 


Reference 


Ret 


1 


1 


HCV-l 


U.SjV. 


Chooetai,, 1991 


(4) 


2 


1 


Pt-1 


Japan 


Nakaoera/,. 1991 


(23) 










Enomoto et al., 1990 


(7) 


3,4 


1 


H77, H90 


U.Sj\- 


Ogaia etaL, 1991 


(24) 


5,6 


I 


GM-1, GM-2 


Germany 


Fuchscfo/., 1991 


(U) 


7 


1 


Jl 


Japan 


Han«a/.. 1991 


(13) 


8 


I 


Al 


Australia 


Hanaro/.. 1991 


(13) 


9 


1 


SI 


S. Africa 


Han fro/.. 1991 


(13) 


10 


I 


Tl 


Taiwan 


Han etaL, 1991 


(13) 


11 


I 


U18A24 


U.SJ\/Itaiy 


Han etal., 1991 


(J3) 


12 


I 


HCV-J 


Japan 


Kaio et aU 1990 


(16) 


13 


1 


HCV-BK 


Japan 


Takamizawa et al., 1991 


(33) 


14,15 


I 


HC-Jl. -J4 


Japan 


Okamoco et aL, 1990 


(26) 


16-20 


1 


KU Kl- 1-4 


Japan 


Enomoto et al„ 1990 


(7) 


21 


1 


JH 


Japan 


fCubo et al„ 1990 


(17) 


22 


1 


J7 


Japan 


Takcuchi et al., 1990 


(34) 


23-26 


2 


K2a.K2a-l. 


Japan 


Nakaoffffl/., 1991 


(23) 






K2b. K2b-l 




Enomoto etal.. 1990 


(7) 


27 


0 


Cbne A 


Japan 


Tsukiyama-Kohanu 1991 


(35) 



wo 93/10239 PCr/GB92/02143 

- 30 - 
TABLE 3 



NUCLEOnDE DISTANCES BETWEEN THE THREE HCV TYPES 
IN FOUR REGIONS OF THE GENOME. 



REGION 



TYPES (n*) la 



lb 



2b 





I (20) 


0.0163 


n/a" 










(6) 


0.0869 


n/a 


0.0214 










0.0948 


n/a 


0.1331 


n/a 


0.0123 


CORE 


la loj 


0.0358 












lb (5) 


0.0855 


0.0227 










3(1) 


0.1801 


0.1511 


n/£f 


n/d 


0.0000 


NS-3 


la (34) 


0.0699 












lb (3) 


0:2270 


0.0535 










3(4) 


03689 


0.4279 


D/d 


n/d 


0.0460 


NS-5 


la (4) 


0.0743 












lb (7) 


0.2477 


0.0372 










2a (2) 


0.6092 


0.6206 


0.0612 








2b (3) 


0.5214 


0.5732 


0.2252 


0.0655 






3(4) 


0.4754 


0.4890 


0.5983 


0.5299 


0.0322 



"nQmber of sequences analysed 
Via : not applicable 
"n/d ; not done 
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-Bi- 
ll) . SEROLOGICAL REACTIVITY OF BLOOD DONORS INFECTED WITH 

THREE DIFFERENT TYPES OF HEPATITIS C VIRUS. 

HCV sequences were amplified in the 5 'non-coding 
region (5 'NCR), core, NS-3 and NS-5 regions from blood 
donors, haemophiliacs and intravenous drug abusers. 

Blood donations that were repeatedly reactive on 
screening with Abbott 2nd generation enzyme immunoassay 
(EIA) and positive or indeterminate by Ortho recombinemt 
immunoblot assay (RIBA) were amplified by primers in the 
5 'NCR (reference 10) . The first fourteen PCR-positive 
blood donations (where PCR was used to amplify and thus 
detect HCV RNA present in the blood) were then typed by 
sequence cmalysis of the amplified region, and compared 
with their serological reactivity to a range of structural 
and non-structural peptides in two 1st generation EIAs 
(Ortho HCV EUSA; Abbott HCV EIA) and two RIBA assays 
(ortho RIBA and Innogenetics LIA; Table 4) , The five 
donations containing HCV type 1 sequences were positive in 
both EIAs, reacted with all antigens in the Ortho RIBA 
assay, and were broadly reactive in the LIA. However, all 
but two of the sera from donors with type 2 and 3 
infections were completely negative on anti-ClOO EIA 
screening and failed to react with 5-1-1, ClOO (RIBA) and 
NS4 (LIA). 
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Furthermore, some carriers of HCV type 3 variants 
reacted poorly with the C33 (NS-3) peptide in the Ortho 
RIBA, and yielded two "indeterminate" results (donor nos. 
11 and 13) . 

ThuB, current tests using Ortho RIBA and (to a lesser 
extent) Innogenetics LIA tests are unable to reliably 
detect HCV-2 and HCV-3 genotypes. For reliable testing 
for all HCV types, antigens from 5-1-1, ClOO and NS4 for 
each of the three types of HCV should preferably be 
included in the panel of antigens. 
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TABLE 4 



SEROLO GICAL REACTTVITY OF SERA FROM BLOOD DONORS 
INFECTED WTTH THREE TYPES OF HEPATmS C VIRUS 



OOD07 


BCV 
genotype 


an 
CI 
0* 


tl- 
00 

At 


Ortbo RZBA 
5-1-1 ClOO C33 C22 


TnnngenaticB LZA 
HS4 HS5 Clt 02 C3 C4 


B-bl2 
B-bl5 
B-bl6 
B-bl7 
B-bl8 


1 


+ 
+ 

+ 


+ 
+ 

4- 
*• 


35 4 4 4 


2S 3 1 2 1 1 
2 3 3 2 2 1 

2 3 2 3 3 - 

3 3 3 2 1 1 
3-211- 


B-b9 
B-blO 
B-bll 
B^12 


3 


+ 


♦ 


13 4 


- - 3 113 

- 3 2 2 2 - 

- 3 4 2 2 3 

- 13 12 2 


B-bl 
B-b2 
B-b3 
B-b5 
B-b7 


3 






- - 4 4 
- 2 4 
2 4 


- 13 1-3 

- 21112 
2 2 1 2 2 1 

- - 3 12 3 
2 3 114 



*Ortho HCV EUSA (Recombinant CI 00-3) 

tAbbott HCV EL\ (Hepatitis C Recombinant DNA Antiecn) 

tCore oligopeptides, 1-4 

§Band5 scored - (negative) to 4 (strong positive) according to manufacturers instnictions. 
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PART HI. MAPPING OF AMTIGEWXC DETERMIMANTS IN NS-4 



Introduction 

With, an overall aim of improving serological 
screening assays, we have obtained sequence data from the 
antigenic region of region corresponding to clOO-3 for 
types 2 and 3. This information was used to epitope map 
the region, to define additional immunoreactive peptides 
that could be used to improve serological anti-HCT assays. 

Methods: 

PGR and sequencing - Plasma samples from Scottish blood 
donors yielding repeatedly reactive donations on 2nd 
generation anti-HCV screening (Abbott or Ortho) , and vhich 
were confirmed or indeterminate on confirmatory testing by 
RIBA (Chiron) were referred to the Department of Medical 
Microbiology from the Scottish National Blood Transfusion 
Service Microbiology Reference Laboratory. HCV RNA within 
the plasma samples was extracted and amplified with 
primers in the 5'NCR as described previously (Chan et al., 
1992) . HCV was typed by sequence analysis of the 
an^lified DNA as described previously (Simmonds et al., 
1990} and by RFLP analysis. 

Five samples from different donors infected with HC7 
type 3 (nos. 40, 38, 36, 26 and 1787), four infected with 
type 2 (nos. 351, 59, 940 and 810) and five with type 1 
infection (nos- 16, 42, 77, 1801 and 1825) were amplified 
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with primers corresponding to sense and ant i -sense 
sequences spanning the antigenic region of NS-4 (table 
1) . Nucleotide sequences obtained from the amplified DNA 
were compared and used to define consensus sequences for 
each HCV type. In-frame translation of the nucleotide 
sequences yielded an uninterrupted consensus amino acid 
sequence that was used to define a series of overlapping 
oligopeptides for epitope mapping. 

Epitope mapping and determination of antibody specificities 

Overlapping synthetic peptides were synthesised on 
polypropylene pins using kits commercially available from 
Cambridge Research Biochemicals Ltd, The principle of the 
addition reactions is described in refs (Geysen et al., 
1984; Geysen et al., 1985). Antibody reactions were 
carried out on pins disrupted by sonication (30 minutes) 
in 1% sodium dodecyl sulphate, 0.1% 2-mercaptoethanol, 
O.IM sodium dihydrogen orthophosphate . Pins were 
pre-coated in 1% ovalbumin, 1% bovine serum albumin, 0.1% 
Tween-20 in phosphate buffered saline (PBS) for one hour 
at room temperature. Serum or plasma was diluted 1.40 in 
PBS + 0-1% Tween-20 (PEST) and incubated with the blocked 
pins at 4°C for 18 hours. After washing in 4 changes of 
PBST (10 minutes at room temperature, with agitation) , 
bound antibody was detected by incubation in a 1/20000 
dilution of affinity isolated anti-human IgG, peroxidase 
conjugate (Sigma) for one hour at room temperature. 
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Following washing (4 changes in PBST) , pins were incubated 
in a 0.05% solution of 

azino-di-3-ethyl-benzthiazodinsulphonate in 0-1 M sodium 
phosphate/sodium citrate buffer (pH 4.0) containing 0.03% 
hydrogen peroxide for 20 minutes. Optical densities were 
read at 410 nm. 

RESULTS 

HCV UNA in plasma samples from five donors infected 
with HCV type 3 by sequence analysis of the and by 

RFLP were an^lified in the NS-4 region using primers 
listed in Table 1. Because of the high degree of sequence 
variability in this region r it was necessary to use 
separate sense primers for the an^xlification of different 
HCV types. However, the anti-sense primers were in a 
highly conserved region and could be used for 
aii4)lification of all three types. Sequence analysis was 
carried out as previously described. This gave a 
continuous sequence from position 4911 to 5271 (numbered 
as in Choo et al. , 1991) (Figure 5a) . Little sequence 
variability (hi^ilighted) was observed between the four 
different donors in this region. 

The nucleotide sequences were used to deduce the 
sequence of the encoded peptide (Figure 5b) - The putative 
protein contains mainly hydrophillic residues but no 
potential sites for N-linked glycosylation. Amino acid 
sequence variability with HCV type 3 was confined to only 
five residues (Figure 5b) . However, this region differed 
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considerably from the amino acid sequences of other blood 
donors infected with HCV types 1 and 2 (T16, 42, 77, 1801, 
1825, 351, 940 and 810; Figure 6a}. Sequence comparison 
between the major HCV types from residues 1679 to 1769 
reveals three regions of considereible amino acid sequence 
variability. Most of the observed differences between 
types involve non-synonymous amino acid substitutions, 
particularly alternation of acidic and basic residues in 
the hydrophillic regions. These changes would be expected 
to profoundly alter the overall conformation of the 
protein, and its antigenicity. 

The consensus amino acid sequences in this region of 
types 1-3 (Figure 6b) were used to define three series of 
82 nonameric oligopeptides overlapping by eight of the 
nine residues with those before and after in the series 
(Figure 7a-c) . These were synthesised on a 12 x 8 arrays 
of polypropylene pins as described in Methods. Antibody 
reactivity to the immobilised antigens on the pins was 
determined by indirect ELISA, using an overnight 
incubation with a 1/40 dilution of test serum overnight at 
4®C, followed by washing, and detection with an anti-human 
IgG-peroxidase conjugate and appropriate substrate (see 
Methods) . 

Reactivity of an anti-HCV negative, PCR-negative 
donor, with no Jcnown risk factors for HCV infection with 
the three series of peptides was determined. No 
significant reactivity is shown with any of the 
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HCT-encoded oligopeptides- Reactivity of sera from three 
donors infected with HCV type 3 to each of the 
oligopeptides is shown in Figures 9a-9c. All three sera 
reacted with peptides ranging from No. 13 (sequence 
KPALVPDKE; Figure 7) to Ho» 22 (sequence VLYQQYDEM) in the 
first antigenic region, although the precise peptides 
recognised varied slightly between individuals. All three 
sera reacted to varying extents with a second antigenic 
region, lying in the range from oligopeptides 32 to 42 (of 
sequence ECSOAAPYI to QAQVIAHWQ) . Weater and more 
variable reactivity was observed to peptides 48 to 53. 
Finally, significant reactivity was also observed to 
single oligonucleotides 2 (2 of 3 samples), 61 (2 of 3), 
66 (3 of 3), 73 (3 of 3) and 80 (2 of 3). 

The sequences of the major antigenic regions of HCV 
type 3 differ considerably from those encoded by any of 
the type 1 or type 2 variants. The region bounded by 
peptides 13 to 22 shows average homologies of 50% with HCV 
type 2 variants and 67% with type 1- Between peptides 32 
to 42, there are homologies of 39% with type 2 and 58% 
with type 1 variants. Thus, although similar regions of 
each NS-4 sequence are antigenic, the actual epitopes 
differ considerably between HCV types. 



Discussion 

The NS-4 region of HCV type 3 shows considerable 
sequence divergence from other variants of HCV, that 
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exceeds that found in the core, NS-3 or NS-5 regions 
previously analysed (Chan et al, 1992). The function of 
the protein encoded by this region of the HCV genome is 
unlcnown, and the consequences of this variability on virxis 
replication and pathogenesis are unknown. The function of 
the NS-4 region in flaviviruses and pestiviruses is also 
poorly defined. 

The degree of amino acid sequence varieUsility, and 
the nature of the amino acid substitutions indicate that 
the major sites of antibody reactivity are also those of 
antigenic variability. This undoubtedly underlies the 
restricted cross -reactivity of HCV type 1 NS-4 encoded 
antigens with sera from individuals infected with 
different HCV types. Serological diagnosis of infection 
is currently based entirely on recombinant or synthetic 
oligopeptide sequences derived ultimately from HCV type 1 
sequences (Choo et al., 1991). The serological response 
to infection is often very restricted in its initial 
stages, with antibody to only one of the recombinant 
antigens used for screening. Not only does this present 
difficulties with supplementary antibody tests, where 
reactivity to two HCV-encoded antigens is required for 
confirmation, but can lead to an increased probability of 
failing to detect early infection with HCV types 2 and 3. 

Table 7 relates HCV typing determined by PCR to 
results obtained using type-specific antigens (TSA) and 
shows good correlation for HCVl-3 types. 
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TABLES 

SEQUENCES OF NS-4 ENCODED ANTIGENS FOR (A) IMPROVED 

SER0UXjK:AL diagnosis, and (B) FOR SEROLOGICAL 
DISCRIMINATTON OF INFECTION WITH DIFFERENT HCV TYPES 



A) 
Type 

3 

n 



B) 

Type 



Regun 1 ati91-i708)* 

KPALVPDKEVLYQQYDEMt 

RVWTPDKEILYEAFDEM 
E^VIAPDKEVLYEAFDEM 

Regfon 1 a691-1708) 

KPALVPDKEVLYQQYDEM 

RVWTPDKEILYEAFDEM 
RAVIAPDKEVLYEAFDEM 

KPAUPDEUEVLYREFDEM 
KPAVIPDREVLYREFDEM 



Regicm 2 a710-1728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAAUEEGQRMAEML 
ECASI^AALIEEGQRIAEML 

Regkm 2 a710-1728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAAUEEGQRMAEML 
ECASRAAUEEGQRIAEML 

ECSQHLPYIEGMLAEQF 
ECSQHLPYIEGALAEQF 



* Amino acid pasitiaDS mimbeied as in Qioo et aU (1991). 

t Amino acMl codes: A: niawimy R: areinine: N; aspaiagxnc: D: aspartic add: C: q^stcine: Q: 
g tptarmiw^ E: ^mainic add: G: gtycinc: H: histidine: I: isolcudnc: L: lendnc; K; lysine: M: 
mrrtrirwwn''- F: pbenylalamnc: P: proline: S: serine: T: threonine: W: tryptophan: Y: lyrosine: 
V: valine. 

t Altemarive peptides, where there is vazxabiii^ witilin an HCV type» 
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TABLE 6 

SEQUENCES OF OUGONnJCLEOrtDES SUITABLE FOR DIRECT DETECTION OF 
HCV TYPE 3 IN CLINICAL SPECIMENS BY POLYMERASE CHAIN REACTION 



Nne Rc|^ Poation Poit Seqitciiccs S*'3't 

ofSniase* 



007 NS-4 5293 - AACTCGAGTATCCCACTGATGAACnTCCACAT 

220 NS-4 5278 - CACATGTGCTTCGCCCAGAA 

TS-3a NS-4 5140 + GCCGCCCCATATATCGAACA 

TS-3b NS^ 5161 + GCTCAGGTAATAGCCCACCA 

Type!: 

TS-2a NS-4 5140 + AAAGCCGCCCTCATTGAGGA 

TS-2b NS-4 5161 + GGGCAGCGGATGGCGGAGAT 

TS-la NS-4 5140 + CACTTACCGTACATCGAGCA 

TS-lb NS-4 5161 + GGGATGATGCTCGCCGAGCA 



• Positum of 5'base relative to HCV genomic sequence in Choo « aL, (1991). 

t Q rirntnrion of primer segncnce {-*•: sense: -: ami-sense) 

$ Abfaieviaiions: A: nrtminr C: cytidine: G: goamdine. T: thymidine. 

Y type-specific sense primers for amplification of HCV types 3. 2 and 1 variants. 
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TABLE 7 

COMPARISON OF SEROLOGICAL TYPING BY HCV-TSA WITH PCR 



Nnrobcr TYPE-SPECinC ANTIBODY 

PCR* tested 12 3 U2 1+3 M NTS' NR ^ 

1 57 63 , - . 1 - 3 3 

2 12 . II - - - 1 1 0 

3 47 1 . 45 - 2 - 4 4 

27 11 - 4 1 4 - 3 4 



■Genotype of IKTV sequences amplified by PCR and 1992) 
NTS : No type-specific andbody detected 

* NR : nmncactive widi NS-4 peptides 

* Samples fiom HCVrinfiscted haemophiliacs, nn-iypcd by PCR. 
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PART IV IDENTIFICATION OF HCV TYPE -4 

Introduction 

Investigations were carried out on sequence 
variations in the 5' non-coding region (5 'NCR) of HCV 
samples from a variety of worldwide geographical locations 
(Figure 13), and also in the core region (Figures 15A and 
15B) , Phylogenetic analysis (Figures 14 and 16) revealed 
a new distinct HCV type which we refer to herein as HCV-4. 

Methods 

Samples . RNA was extracted from plasma samples that were 
repeatedly reactive on second generation screening assays 
for HCV, and which were either confirmed (significant 
reactivity with two or more antigens in the Chiron 
recombinant immunoblot assay; Chiron Corporation, 
Emeryville, California, USA) or indeterminate (reactive 
with only one antigen) from blood donors and patients with 
NANBH. Most of the samples containing sequences that 
differed substantially from known HCV types came from 
Egypt (EG 1-33). others came from Holland (NL-26) , Hong 
Kong (HK 1-4), Iraq (IQ-48) and XX (xx-(6) . 

Sequence determination . HCV sequences were reverse 
transcribed and amplified with primers matching conserved 
regions in the 5 'NCR as previously described [1]. For 
analysis of the core region, RNA was reverse transcribed 
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using a primer of sequence CACT/C)GTCA/G)AGGCTATCGATGAC 
(5' base: xxx, numbered as in [20]). cDNA was amplified 
using this primer and a primer in the 5 'NCR of sequence 
ACTGCCTGATAGGGTGCTTGCGAG (5' base: -54). The second PGR 
used primers of sequences AGGTCTCGTAGACCGTGCATCATG (5' 
base: -21) and TTGCG (G/T/C) GACCT ( VT) CGCCGGGGGTC 
(5 'base: xxac) . An^lified DKA in both regions was directly 
sequenced as described previously (reference la) . 

Sequence analysis. Sequences were aligned using the 
CLDSTAL program in the University of Wisconsin GCG package 

(reference 6) . Phylogenetic trees were constructed by the 
DHAMI. program in the PHYLIP package of Felsenstein 

(version 3.4, June 1991; (reference 9), using the global 
option. RNA secondary structures in the 5 'NCR of 4 
representative HCV variants (refs) were predicted using 
the program FOLD* Three predictions were made from each 
sequence between nucleotides -341 to -1, -341 to +300, and 
-341 to +900 to allow for possible long zremge interactions - 
Coii$)arison of the predicted conformations for each 
sequence over the different lengths showed that only 
relatively small scale features, such as the stem/loop 
analysed in the Results were at all conserved (data not 
shown) . 

All sequences reported in this part have been 
submitted to GenBank. 
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Results 

Diverqen-t 5^ NCR sequences. Several sequences in the 5'NC 
^ region detected in samples of blood donors from Saudi 

Arabia, Holland cind Hong Kong, and from NANBH patients in 
Iraq and xxx differed substantially from those found in 
Scottish blood donors and those reported elsewhere (Figure 
13) . Instead of shoving the well characterised nucleotide 
substitutions that distinguish HCV types 1, 2 and 3 from 
each other, a new set of sequence differences were 
observed in the new variants that appeared to place them 
outside the existing system of virus classification. This 
can be more simply represented by reconstructing a 
phylogeny of the sequences and presenting the results as 
an evolutionary tree (Figiire 14) . This analysis confirms 
that sequences 1-10 cluster separately from the varieints 
previously typed as 1,2 and 3. For convenience we will 
refer to sequences within this new group as type 
HCV type 4. Mean distances within type 4 and between type 
4 and the other HCV types in the 5 'NCR were compareUt>le to 
those previously described for type 1-3. Although 
sequences within type 4 are relatively closely grouped, 
sequences 11, 12 and 13 differ considerably from any of 
the known types. 

c Using this phylogenetic tree, it can be seen that the 

majority of previously published 5 'NCR sequences can be 
readily identified as types 1, 2 or 3. Furthermore, 
almost all of the sequences from Zaire (shown as hollow 
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squares) cluster closely within type 4, suggesting a wider 
distribution in Africa. However, a further complication 
is that three identical sequences obtained from South 
African patients appeared distinct from both the type 1 
and the type 4 group, and may represent yet another HCV 
type- 

RNA from three representative type 4 variants (Eg 29, 
33, 21; corresponding to 5 'NCR sequences nos. 1-3) was 
aii?>lified using primers in the core region of HCT 
polyprotein. All three sequences differed considerably at 
both the nucleotide and amino acid level from HCV types 1 
to 3 (Figure 15A/B) . Phylogenetic analysis of these 
sequences and those previoxisly analysed indicated that 
they formed a separate, relatively homogeneous group 
distinct from the other types (Figure 16) . Reconstructed 
nucleotide distances between type 4 and types 1-3 were 
comparable to those that exist between the three known HCV 
types of HCV. Althou^ most of the nucleotide sequence 
differences were silent, there were between 4 and 9 amino 
acid differences between the new variants and other types. 
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PART V HCV TYPING 
Introduction 

In view of the sequence variations between HCV types 
1,2,3 and 4 differences in restriction enzyme cleavage sites 
exist, leading to different endonuclease cleavage patterns. 
This technique was used to identify HCV genotypes in blood 
samples from a variety of sources worldwide, 

(A) Tvpincr of HCVl-3 
METHODS 

Serum Samples ; Samples from blood donors in six countries, 
Scotland, Finland, Netherlands, Hong Kong and Australia and 
Japan, were available from routine 2nd Generation anti-HCV 
EXiISA screening (Ortho or Abbott) . Donor samples that were 
repeatedly reactive in the above tests were further 
investigated using a supplementary test (Ortho RIBA: 
Finland, Netherlands, Australia, Egypt, Abbott Matrix: Hong 
Kong) or samples were titred for anti-HCV by ELISA (Japan) . 
Samples that were positive (significant reactivity with two 
or more HCV antigens (1+ to 4+) or indeterminate (reactivity 
with one antigen only) in the RIBA test or had a titre of > 
X 4096 by ELISA (Japan only) were tested for viral RNA by 
Polymerase Chain Reaction (PCR) . 

RNA PCR ; PGR for the detection of HCV RNA was carried out as 
previously described by Chan et al (reference la) using 
primers in the 5 'non-coding region (5 'NCR) in a nested PCR, 
with primers 209/939 and 211/940(5) in first and second 
reactions respectively. 
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HCT TYPING 

The existence of relatively conserved patterns of 
substitutions in the 5'NCR that are characteristic of 
different HCT types provide useful signature sequences for 
identification of HCV genotypes. Having compared large 
numbers of different HCV type 1, 2 and 3 sequences, we 
developed a method that differentiated HCV types 1-3 by 
restriction endonuclease cleavage of amplified DNA. 
However, the 19 type 4 sequences would appear as type 1 
(electrophoretic types Aa and Ab) , and for concurrent 
studies it has been necessary to modify the conditions to 
identify the new HCV type. All type 4 sequences showed a 
T — >C change at position -167 that creates a novel Hinfl 
site that is absent in all type 1 (and type 2) sequences. 
In combination with ScrFI, and Haelll/Rsal, it has now 
proved possible to identify the new type reliably in 
numerous coxmtries in the Middle East and elsewhere. 



RESULTS 

The results are summarised in Table 8 for HCV types 
1, 2 and 3. The Egyption samples gave abberant 
restriction patterns on the single ScrFI digest and were 
identified as type 4, 
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TABLE 8 



PREVALENCE OF HCV TYPES IN DIFFERENT COUNTRIES 



COUNTRY 



HCV TYPES f%) 
HCV-1 HCV-2 HCV-3 



Scotlcind 



86(51%) 21(13%) 60(36%) 



Finland 



3(25%) 5(42%) 4(33%) 



Netherlands 



18(60%) 7(23%) 



5(17%) 



Hong Kong 



22(63%) 0(0%) 



0(0%) 



Australia 



13(57%) 3(13%) 7(30%) 



Japan 



31(77%) 9(23%) 0(0%) 



Egypt 



0(0%) 0(0%) 



0(0%) 
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(B) MODIFICATION OF PCR-BaSED TYPING ASSAY TO DETECT 
- TNFECTION WTTH HCV TYPT^ A TN CUNTCAL SPECIMENS 



Methods 

^^rrtlon of RNA > RNA was extracted from 100 ul aliquots 
of plasma of non-A, non-B patients by addition of 1 ml 
RHAzol solution (2M guanidinium thiocyanate, 12.5 mM 
sodium citrate [pH7.03, 0.25% w/v N-lauroylsarcosine, 
0.05M 2-mercaptoethanol, lOOmM sodium acetate [pH 4.0], 
50% w/v water saturated phenol) as previously described 
(Chamczynski et al. 1987), and mixed until precipitate 
dissolved. After addition of 100 ul chloroform, each 
sample was spun for 5 minutes at 14000 x g and the aqueous 
phase re-extracted with 0.5 ml chloroform. UNA was 
precipitated by addition of an equal volume of isopropanol 
and incubation at -20OC for at least 1 hour. An RKA 
pellet was produced by centrifugation at 14000 x g for 15 
minutes at 4OC, washed in iml 70% cold ethanol solution, 
dried and resuspended in 20 ul diethylpyrocarbonate 
treated distilled water. Of the 100 directly extracted 
swplesr a total of 19 were PCR-negative (see below). Two 
ml volumes of the negative samples were ultracentrifuged 
at 200 000 X g for 2 hours and the pellet re-extracted as 
described above. Extraction from the larger volume of 
plasma yielded an additional 3 positive sauries (numbers 
66, 80, 85). 
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PCR and typincr . RNA was reverse trsoiscribed with primer 
940 and cDNA amplified in a two stage nested PCR reaction 
with primers 940/939, followed by 209/211 as previously 
described (Chcm et al. 1992). PCR product was 
radiolabelled with [^^SJ-dATP analysed by restriction 
endonuclease cleavage (HcOmish et al. Transfusion, 
32:no.ll 1992). Samples were cleaved with ScrFI and a 
combination of Haelll/Rsal in two separate reactions to 
identify HCV types 1/4, 2, 3. Figure 17 shows 
endonuclease cleavage patterns. HCV types 1 and 4 were 
differentiated by a third reaction with Hinfl (see 
Results) . Two samples yielded restriction patterns that 
were different from those of the four known types of HCV 
cind were analysed further by direct sequence analysis of 
the amplified DNA (Chan et al. 1992) . These two samples 
contained 5 'NCR sequences distinct from those of known HCV 
types and currently remain unclassified. 

RESULTS 

Modification of RFLP method to identify HCV type 4. 
Previous sequence analysis in the 5 'NCR of HCV amplified 
from plasma of Egyptian blood donors revealed a relatively 
homogeneous group of novel sequence variemts in both the 
5 'NCR and core region which were as distinct from HCV 
types 1, 2 and 3 as these latter types were from each 
other (see previous submission) . This new group was 
designated as HCV type 4. 
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con^arison of cleavage patterns of type 4 sequences 
with those of type RFLP analysis of the previously 
identified type 4 sequences produced a distribution of 
electropherotypes with ScrFI and Haelll/Rsal similar to 
that HCV type 1 (Table 9) . Type 1 sequences yielded 9 
patterns of aA/Br 35 of hA/B and 1 bC. With these enzymes 
alone, type 4 sequences were thus indistinguishable from 
type 1 (14 aA/B, 4 hk/B) • However, type 1 and type 4 
sequences consistently differ in the number of Hinfl 
sites. ATT 18 type 4 sequences contain one or two 
potential cleavage sites (producing patterns band c; table 
5) while none are found in any of the 45 type 1 sequences 
analysed (pattern a) . One of the type 4 sequences was 
further differentiated from type 1 and other HCV types by 
the loss of a restriction site for Rsal, leading to a new 
pattern of bands designated h (44, 172, 9, 26; first 
coltnnn. Table 9) . Finally, a single sequence, EG-28 lost 
two sites to produce bemds of 216, 9, and 26 bps (pattern 
i; Table 9) • This sequence was distinct from that of any 
of the known HCV types (including type 4) and is shown in 
the table in the column labelled XT (unclassified) 

Typing of study subjects. HNA was esctracted from 100 
saatples of patients with NANB hepatitis and amplified with 
primers in the 5 'NCR. Of these, 84 were PCR positive, and 
enabled HCV typing to be carried out by RPXP. This was 
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initially carried out with Haelll/Rsal and ScrFl, and 
allowed the identification of 10 type 2 and 10 type 3 
variants (Table 10) . Samples showing electrophoretic 
patterns ak/B or bA/B were further analysed by cleavage 
with hinfl, yielding 38 samples with pattern a, thus 
identified as type 1, 22 with pattern b and 2 with pattern 
c, both identified as type 4. Finally, two samples showed 
the unusual cleavage patterns h and i with Haelll/Rsal and 
pattern b with Hinfl, and were therefore directly 
sequenced. These two sequences were similar to each other 
but were unlike any of the known HCV types, and also 
distinct from EG-28, the other sequence showing pattern i 
with Haelll/Rsal (Table 10) . As they cannot be currently 
classified, they will be referred to as type U. 
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TABLE 9 

PREDICTED CLEAVAGE PATTERNS OF PUBLISHED 5'NCR SEQUENCES 
OF HCV TYPES U 2. 3 AND 4 WTHi RsdUHadSL ScrFl AND Hitfi 



PrcdictBd deavagepa 


[ttcnr* 




HCV type 






















/Kiol 


ScrFT 


Hittfl 


I 


2 


3 


4 




a 


AfB 




9 


- 


- 






b 


Am 


a 


35 


- 


- 






b 


C 


a 


1 






_ 


_ 


a 


A^ 


b** 


~ 


- 


- 






a 


A/B 






- 


- 


1 




b 


A/B 


b 








4 


_ 


c 


D 


a 




5 








d 


D 


a 


_ 


1 




- 


- 


d 


E 


a 




2 








e 


D 


a 




1 










E 


a 




I 








f 


G 


b 






14 






f 


G 


c 






1 






g 


G 


b 






8 








A/B 


b 








1 




? 


A/B 


b 










I 



• Cleavage pancms tff^i g"'^'^ for HaenURsdi and ScrFl as described previously (McOmish 
etaL 1992). 

HCV variant of imdesignrd qrpe 

* Fanem a : nndcaved by fln^ 

<i Pattern b:DNA cleaved CO gcnetaie two fragments of sizes 107 and 142 bps (in order 5'-> 
3') 

• Panem c : DNA deaved to generate three fta g mems of 56. 51 and 142 bps 
^Ncwdeavage pattern for gaenKRsaldesignaTrri fa (bands of 44 bps, 172 bps. 9 bps. 26 bps) 

* New cfeavage pattern for HaeWRsdl dfsignaTffd i (216 bps. 9 bps. 26 bps) 
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TABLE 10 

IDENTinCATlON OF HCV TYPES M .N STUDY SUBJECTS BY RFLP 
ANALYSIS OF 5'NCR SEQUENCES WITH Rsal/HaeUl, ScrFl AND Hu^ 





Inferred HCV type 




Baem 
















Sietrl 


KMWjk 


1 2 


3 


4 


a 


A/B 


a 


2 


- 


• 




b 


A/B 


a 








_ 


a 


A/B 


b 






16 




a 


AJB 


c 


- 


- 


2 




b 


A/B 


b 






o 


- 


c 


D 


n.d. 


7 








d 


E 


iLd. 


3 








f 


G 


iLd 




7 






S 


G 


xuL 




3 






h 


Am 


b 








1 


i 


Am 


b 








1 






TOTALS 


38 10 


10 


24 


2 



• Two samples yielded unusual restriction pancrns with HaeJWRsdl (h, i). Sequence analysis 
of tbe rNCR placed them outside existing HCV classification (samples IQ^, EG-96). 
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PART VI Expression and As say etc. Techniques 

The present invention also provides eacpression 
vectors containing the DNA sequences as herein defined, 
^rtiich vectors being capable, in an appropriate host, of 
eacpressing the DHA sequence to produce the peptides as 
defined herein. 

The expression vector normally contains control 
elements of DNA that effect expression of the DNA sequence 
In an appropriate host. These elements may vary according 
to the host but usually incltide a promoter, ribosome 
binding site, translational start and stop sites, and a 
transcriptional termination site. E x a mp les of such 
vectors include plasmids and viruses. Eacpression vectors 
of the present invention encon^ass both extrachromosomal 
vectors and vectors that are integrated into the host 
cell's chromosome. For use in E>coli, the expression 
vector may contain the DNA sequence of the present 
invention optionally as a fusion linked to either the 5'- 
or 3 '-end of the DNA sequence encoding, for example, 
B-galactosidase or to the 3 '-end of the DNA sequence 
encoding, for example, the trp E gene. For use in the 
insect baculoviriis (AcNFV) system, the DHA sequence is 
optionally fused to the polyhedrin coding sequence. 

The present invention also provides a host cell 
transformed with ea^ression vectors as herein defined. 

Eacan^les of host cells of use with the present 
invention include pro3caryotic and eu3caryotic cells, such 
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as bacterial, yeast, mammalian and insect cells. 
Particular examples of such cells are E.coli, 
S.cerevisiae, P.pastoris, Chinese hamster ovary and mouse 
cells, and Spodoptera fruqiperda and Tricoplusia ni. The 
choice of host cell may depend on a number of factors but, 
if post*translational modification of the HCV viral 
peptide is important, then an euXaryotic host would be 
preferred. 

The present invention also provides a process for 
preparing a peptide as defined herein which comprises 
isolating the DNA sequence, as herein defined, from the 
HCV genome, or synthesising DNA sequence encoding the 
peptides as defined herein, or generating a DNA sequence 
encoding the peptide, inserting the DNA sequence into an 
expression vector such that it is capable, in an 
appropriate host, of being expressed, transforming host 
cells with the expression vector, culturing the 
transformed host cells, and isolating the peptide. 

The DNA sequence encoding the peptide may be 
synthesised using standard procedures (Gait, 
Oligonucleotide Synthesis; A Practical Approach > 1984, 
Oxford, IKL Press) . 

The desired DNA sequence obtained as described above 
may be inserted into an expression vector using known and 
standard techniques. The expression vector is normally 
cut using restriction enzymes and the DNA sequence 
inserted using blunt-end or staggered-end ligation. The 
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cat is usually made at a restriction site in a convenient 
position in the expression vector such that, once 
inserted, the DKA sequences are under the control of the 
functional elements of DNA that effect its expression. 

Transformation of an host cell may be carried out 
using standard techniques. Some phenotypic marJcer is 
usually employed to distinguish between the transformants 
that have successfully taJcen up the expression vector and 
those that have not. culturing of the transformed host 
r^cij and isolation of the peptide as required may also be 
carried out using standard techniques. 

The peptides of the present invention may be prepared 
hy synthetic methods or by recombinant DKA technology. 
The peptides are preferably synthesized using automatic 
synthesizers. 

Antibody specific to a p^ide of the present 
invention can be raised using the peptide. The antibody 
may be polyclonal or monoclonal. The antibody may be used 
in quality control testing of batches of the peptides; 
purification of a peptide or viral lysate; epitope 
jnapping; when labelled, as a conjugate in a con^titive 
type assay, for antibody detection? and in antigen 
detection assays. 

Polyclonal antibody against a peptide of the present 
invention may be obtained by injecting a peptide, 
optionally coupled to a carrier to promote an immune 
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response, into a mamnalian host, such as a mouse, rat, 
sheep or rabbit, and recovering the antibody thus 
produced. The peptide is generally administered in the 
form of an injectable formulation in which the peptide is 
admixed with a physiologically acceptable diluent. 
Adjuvemts, such as Freund's complete adjuvemt (FCA) or 
Freund's incomplete adjuvant (FXA) , may be included in the 
formulation. The formulation is normally injected into 
the host over a suitable period of time, plasma samples 
being taken at appropriate intervals for assay for 
anti-HCV viral antibody. When an appropriate level of 
activity is obtained, the host is bled. Antibody is then 
extracted and purified from the blood plasma using 
standard procedures, for example, by protein A or 
ion-exchange chromatography. 

Monoclonal antibody against a peptide of the present 
invention may be obtained by fusing cells of an 
immortalising cell line with cells which produce antibody 
against the viral or topographically related peptide, and 
culturing the fused immortalised cell line. Typically, a 
non-human mamnalian host, such as a mouse or rat, is 
inoculated with the peptide. After sufficient time has 
elapsed for the host to mount an antibody response, 
antibody producing cells, such as the splenocytes, are 
removed. Cells of an Immortalising cell line, such as a 
mouse or rat myeloma cell line, are fused with the 
antibody producing cells and the resulting fusions 
screened to identify a cell line, such as a hybridoma. 
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that secretes the desired monoclonaX antibody. The fused 
cell line may be ciiltured and the monoclonal antibody 
purified from the culture media in a similar manner to the 
purification of polyclonal antibody. 

Diagnostic assays beised upon the present invention 
may be used to determine the presence or absence of HCV 
infection. They may also be used to monitor treatment of 
such infection, for example in interferon therapy. 

In an assay for the diagnosis of viral infection, 
there are basically three distinct approaches that can be 
adopted involving the detection of viral nucleic acid, 
viral antigen or viral antibody. Viral nucleic acid is 
generally regarded as the best indicator of the presence 
of the virus itself and would identify materials likely to 
be infectious. However, the detection of nucleic acid is 
not usually as straightforward eis the detection of 
euxtigens or antibodies since the level of target can be 
very low. Viral antigen is used as a marlcer for the 
presence of virus and as an indicator of inf ectivity. 
Depending t^on the virus, the euaount of antigen present in 
a san^le "^^n be very low and difficult to detect. 
Antibody detection is relatively straightforward because, 
in effect, the host immune system is amplifying the 
response to an infection Jay producing large amounts of 
circulating antibody- The nature of the antibody response 
can often be clinically useful, for example IgM rather 
than IgG class antibodies are indicative of a recent 
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infection, r the response to a particular viral cuitigen 
may be associated with clearance of the virus. Thus the 
exact approach adopted for the diagnosis of a viral 
infection depends upon the particular circumstances and 
the information sought. In the case of HCV, a diagnostic 
assay may embody any one of these three approaches. 

In an assay for the diagnosis of HCV involving 
detection of viral nucleic acid, the method may comprise 
hybridising viral RNA present in a test sample, or cDNA 
synthesised from such viral RNA, with a DNA sequence 
corresponding to the nucleotide sequences of the present 
invention or encoding a peptide of the invention, and 
screening the resulting nucleic acid hybrids to identify 
any HCV viral nucleic acid. The application of this 
method is usually restricted to a test sample of cin 
appropriate tissue, such as a liver biopsy, in which the 
viral RNA is likely to be present at a high level. The 
DNA sequence corresponding to a nucleotide sequence of the 
present invention or encoding a peptide of the invention 
may take the form of an oligonucleotide or a cDNA sequence 
optionally contained within a plasmid. Screening of the 
nucleic acid hybrids is preferably carried out by using a 
labelled DNA sequence. Preferably the peptide of the 
present invention is part of an oligonucleotide %rtierein 
the label is situated at a sufficient distance from the 
peptide so that binding of the peptide to the viral 
nucleic acid is not interfered with by virtue of the label 



wo 93/10239 



PCr/GB92/02143 



- 62 - 

being too close to the binding site. One or more 
additional rounds of screening of one kind or another may 
be carried out to characterise further the hybrids and 
thus identify any HCV viral nucleic acid. The steps of 
h^ridisation and screening are carried out in accordance 
with procedures known in the art. 

The present invention also provides a test kit for 
the detection of HCV viral nucleic acid, which comprises 

i) a labelled oligonucleotide comprising a DNA sequence 
of the present invention or encoding a peptide of the 
present invention; and 

ii) washing solutions, reaction buffers and a substrate, 
if the label is an enzyme. 

Advantageously, the test kit also contains a positive 
control san^le to facilitate in the identification of 
viral nucleic acid. 

In an assay for the diagnosis of HCV involving 
detection of viral antigen or antibody, the method may 
comprise contacting a test sample with a peptide of the 
present invention or a polyclonal or monoclonal antibody 
against the peptide and determining whether there is any 
antigen-antibody binding contained within the test sample. 
For ^-Tii'g purpose, a test Jcit may be provided cos^rising a 
peptide, as defined herein, or a polyclonal or monoclonal 
antibody thereto and means for determining whether there 
is my binding with antibody or antigen respectively 
contained in the test sample- The test sample may be 
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taken from any of the appr priate tissues euid 
physiological fluids mentioned above for the detection of 
viral nucleic acid. If a physiological fluid is obtained, 
it may optionally be concentrated for any viral cuitigen or 
antibody present. 

A variety of assay formats may be employed. The 
peptide can be used to capture selectively antibody 
against HCV from solution, to label selectively the 
antibody already captured, or both to capture and leUoel 
the antibody. In addition, the peptide may be used in a 
variety of homogeneous assay formats in vhich the cintibody 
reactive with the peptide is detected in solution with no 
separation of phases. 

The types of assay in which the peptide is used to 
capture antibody from solution involve immobilization of 
the peptide on to a solid surface. This -surface should be 
capable of being washed in some way. Examples of suitable 
surfaces include polymers or various types (moulded into 
microtitre wells; beads; dipsticks of various types; 
aspiration tips; electrodes; and optical devices), 
particles (for example latex; stabilized red blood cells; 
bacterial or fungal cells; spores; gold or other metallic 
or metal-containing sols; and proteinaceous colloids) with 
the usual size of the particle being from 0.02 to 5 
microns, membranes (for example of nitrocellulose; paper; 
cellulose acetate; and high porosity/high surface area 
membranes of an organic or inorganic material) . 
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The attachment of the peptide to the surface can be 
by passive adsoption from a solution of optimum 
composition which may include surfactants, solvents, salts 
and/or chaotropes; or by active chemical bonding. Active 
bonding may be through a variety of reactive or 
activatible functional groups ^ich may be exposed on the 
surface (for exan^le condensing agents? active acid 
esters, halides and anhydrides; amino, hydroxyl, or 
carboxyl groups; sulphydryl groups; carbonyl groups; 
diazo groups; or unsaturated groups)- Optionally, the 
active bonding may be through a protein (itself attached 
to the surface passively or through active bonding) , such 
as T^1>-*'im^T. or casein, to which the viral peptide may be 
chemically bonded by any of a variety of methods. The use 
of a protein in this way may confer advantages because of 
isoelectric point, charge, hydrophilicity or other 
physico-chemical property. The viral peptide may also be 
attached to the surface (usually but not necessarily a 
membrane) following electrophorectic separation of a 
reaction mixture, such as immunoprecipitation- 

After contacting (reacting) the surface bearing the 
peptide with a test sample, allowing time for reaction, 
and, vrtiere necessary, removing the excess of the sample by 
any of a variety of means, (such as washing, 
centrifugation, filtration, magnetism or capilliary 
action) the captured antibody is detected by any means 
which will give a detectable signal. For example, this 
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may be achieved by use of labelled molecule or particle as 
described above which will react with the captured 
antibody (for example protein A or protein G and the like; 
anti-species or anti-immunoglobulin-sub-type; rheumatoid 
factor; or antibody to the peptide, used in a competitive 
or blocking fashion) , or any molecule containing an 
epitope contained in the peptide. 

The detectable signal may be optical or radioactive 
or physico-chemical and may be provided directly by 
labelling the molecule or particle with, for example, a 
dye, radiolabel, electroactive species, magnetically 
resonant species or fluorophore, or indirectly by 
labelling the molecule or particle with an enzyme itself 
capeUdle of giving rise to a measurable change of any 
sort. Alternatively the detectable signal may be obtained 
using, for example, agglutination, or through a 
diffraction or birefringent effect if the surface is in 
the form of particles. 

Assays in which a peptide itself is used to label an 
already captured antibody require some form of labelling 
of the peptide which will allow it to be detected. The 
labelling may be direct by chemically or passively 
attaching for example a radio label, magnetic resonant 
species, particle or enzyme label to the peptide; or 
indirect by attaching any form of label to a molecule 
which will itself react with the peptide. The chemistry 
of bonding a label to the peptide can be directly through 
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a moiety already present in the peptide, such as an amino 
group, or through an intermediate moiety, such as a 
maleimide group. Capture of the antibody may be on any ol 
the surfaces already mentioned by any reagent including 
passive or activated adsorption ^rtiich will result in 
specific antibody or immune complexes being bound. In 
particular, capture of the antibody could be by 
anti-species or anti-immunoglobulin-sub-type, by 
rheumatoid factor, proteins A, G and the like, or by any 
molecule containing an epitope contained in the peptide- 

The labelled peptide may be used in a competitive 
binding fashion in which its binding to any specific 
molecule on any of the surfaces exemplified above is 
blocked by antigen in the sample. Alternatively, it may 
be used in a non-con^etitive fashion in which antigen in 
the san^le is bound specifically or non-specif ically to 
any of the surfaces above and is also bound to a specific 
bi- or polyvalent molecule (e.g. an antibody) with the 
remaining valencies being used to capture the labelled 
peptide. 

Often in homogeneous assays the peptide and an 
antibody are separately labelled so that, when the 
antibody reacts with the recombinant p^tide in free 
solution, the two labels interact to allow, for example, 
non-radiative transfer of energy captured by one label to 
the other label with appropriate detection of the excited 
second label or quenched first label (e.g. by fluorimetry. 
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magnetic resonance or enzyme measurement) . Addition of 
either viral peptide or antibody in a sample results in 
restriction of the interaction of the labelled pair and 
thus in a different level of signal in the detector. 

A suitable assay format for detecting HCV antibody is 
the direct sandwich enzyme immunoassay (EIA) format. A 
peptide is coated onto microtitre veils. A test sample 
and a peptide to which an enzyme is coupled are added 
simultaneously. Any HCV antibody present in the test 
sample binds both to the peptide coating the well and to 
the enzyme-coupled peptide. Typically, the same peptide 
are used on both sides of the sandwich. After washing, 
bound enzyme is detected using a specific substrate 
involving a colour change. A test kit for use in such an 
EIA comprises: 

(1) a peptide, as herein defined labelled with an enzyme; 

(2) a substrate for the enzyme; 

(3) means providing a surface on which a peptide is 
Immobilised; and 

(4) optionally, washing solutions and/or buffers. 

It is also possible to use IgG/IgM antibody capture 
ELISA wherein an antihuman antibody is coated onto 
microlitre wells, a test sample is added to the well. Any 
IgG or IgM antibody present in the test sample will then 
bind to the anti-humem antibody. A peptide of the present 
invention, which has been labelled, is added to the well 
and the peptide will bind to any IgG or IgM antibody which 
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has resulted due to infection by HCT. The IgG or IgM 
antibody can be visualized by virtue of the label on the 
peptide. 

It can thus be seen that the peptides of the present 
invention may be used for the detection of HCV infection 
in many formats, namely as free peptides, in assays 
including classic EIJSA, competition EI.ISA, membrane bound 
EIA and immunoprecipitation. Peptide conjugates may be 
used in aa^jlified assays and IgG/IgM antibody capture 
ELISA. 

An assay of the present invention may be used, for 
exan^le, for screening donated blood or for clinical 
purposes, for exan^le, in the detection and monitoring of 
HCT infections- For screening purposes, the preferred 
assay formats are those that can be automated, in 
particular, the microtitre plate format and the bead 
format- For clinical purposes, in addition to such 
formats, those suitable for smaller-scale or for single 
use, for example, latex assays, may also be used- For 
confirmatory assays in screening procedures, antigens may 
be presented on a strip suitable for use in Western or 
other immnnoblotting tests - 

As indicated above, assays used currently to detect 
the presence of anti-HCV antibodies in test samples, 
particularly in screening donated blood, utilise antigenic 
peptides obtained from HIV type 1 only and, as 
demonstrated herein, such antigens do not reliably detect 
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Other HCV genotypes. Accordingly, it is clearly desirable 
to supplement testing for HIV-1 with testing for all other 

« genotypes, for example, types 2, 3 and 4, and also any 

further genotypes that may be discovered. 

To test for a spectrum of genotypes, there may be 
provided a series of assay means each comprising one or 
more antigenic peptides from one genotype of HCV, for 
example, a series of wells in a microtitre plate, or an 
equivalent series using the bead format. Such an assay 
format may be used to determine the genotype of HCV 
present in a sample. Alternatively, or in addition, an 
assay means may comprise antigenic peptides from more than 
one genotype, for example, a microwell or bead may be 
coated with peptides from more than one genotype. 

It has been found advantageous to use more than one 
HCV antigen for testing, in particular, a combination 
comprising at least one antigenic peptide derived from the 
structural region of the genome and at least one antigenic 
peptide derived from the non-structural region, especially 
a combination of a core antigen and at least one antigen 
selected from the NS3, NS4 and NS5 regions. The wells or 
beads may be coated with the antigens individually. It 
has been found advantageous, however, to fuse two or more 

, antigenic peptides as a single polypeptide, preferably as 

a recombinant fusion polypeptide. Advantages of such an 
approach are that the individual antigens can be combined 
in a fixed, predetermined ratio (usually equimolar) and 
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that only a single polypeptide needs to be produced, 
purified and characterised. One or more such fusion 
polypeptides may be used in an assay, if desired in 
addition to one or more unfused peptides. It will be 
predated that there are many possible combinations of 
antigens in a fusion polypeptide, for example, a fusion 
polyp^tide may coii«)rise a desired range of antigens from 
one serotype only, or may comprise antigens from more than 
one serotype. The antigenic peptides from serotypes 2, 3 
and 4 are preferably those described herein. 

To obtain a polypeptide con^trising mnltiple peptide 
antigens, it is preferred to fuse the individual coding 
sequences into a single open reading frame. The fusion 
should, of course, be carried out in such a manner that 
the antigenic activity of each con^onent peptide is not 
significantly con^romised by its position relative to 
another peptide. Particular regard should of course be 
had for the nature of the sequences at the actual junction 
between the peptides- The resulting coding sequence can 
be expressed, for example, as described above in relation 
to recombinant peptides in general. The methods by which 
such a fusion polypeptide can be obtained are known in the 
art, and the production of a recombinant fusion 
polypeptide comprising multiple antigens of a strain of 
nCV type 1 is described in GB-A-2 239 245 
immunoprecipitation. Peptide conjugates may be used in 
amplified assays and igG/IgM antibody capture ELISA- 
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The peptide of the present invention may be 
incorporated into a vaccine formulation for inducing 
immunity to HCV in man. For this purpose the peptide may 
be presented in association with a pharmaceutically 
acceptable carrier. 

For use in a vaccine formulation, the peptide may 
optionally be presented as part of an hepatitis B core 
fusion particle, as described in Clarke et al ( Nature , 
1987, 330 , 381-384), or a polylysine based polymer, as 
described in Tam ( PNAS , 1988, 85, 5409-5413). 
Alternatively, the peptide may optionally be attached to a 
particulate structure, such as lipsomes or ISCOMS. 

Pharmaceutically acceptable carriers include liquid 
media suitable for use as vehicles to introduce the 
peptide into a patient. An example of such liquid media 
is saline solution. The peptide may be dissolved or 
suspended as a solid in the carrier. 

The vaccine formulation may also contain an adjuvant 
for stimulating the immune response and thereby enhancing 
the effect of the vaccine. Examples of adjuvants include 
aliiminium hydroxide and aluminiiim phosphate. 

The vaccine formulation may contain a final 
concentration of peptide in the range from 0.01 to 
5 mg/ml, preferably from 0-03 to 2 mg/ml. The vaccine 
formulation may be incorporaterd into a sterile container, 
which is then sealed and stored at a low temperature, for 
example 4°C, or may be freeze-dried. 
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In order to induce immunity in man to HCV, one or 
more doses of the vaccine formulation may be 
administered. Each dose may be 0.1 to 2 ml, preferably 
0.2 to 1 ml. A method for inducing immunity to HCV in 
man, comprises the administration of an effective amount 
of a vaccine formulation, as hereinbefore defined. 

The present invention also provides the use of a 
peptide as herein defined in the preparation of a vaccine 
for use in the induction of immunity to HCV in man. 

Vaccines of the present invention may be administered 
by any convenient method for the administration of 
vaccines including oral and parenteral (e.g. intravenous, 
subcutaneous or intramuscular) injection- a!he treatment 
may consist of a single dose of vaccine or a plurality of 
doses over a period of time. 
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SEQUENCE LISTING (FIGURE I and la) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequence 



SEQUENCE LENGTH : 194 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1-3 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases - 255 to -62 of the 5' non-coding region showing 
variations in cDNA sequence between HCV types 1-3. 
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SEQUENCE LISTING (FIGURE 3} 



SEQ ID NO : 2 

SEQUENCE TYPE : deduced peptide sequence 
SEQUENCE LENGTH : 85 amino acids 



STRANDEDNESS : single 



TOPOLOGY : linear 

MOLECULAR TYPE t HCV peptide 



ORIGINAL SOURCE 



ORGANISM : human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 2648 to 
in peptide sequence 



2732 of NS-5 region showing variations 
between HCV types 1-3. 
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SEQUENCE LISTING (FIGURE 5) 

SEQ ID NO : 3 

SEQUENCE TYPE : deduced peptide sequence 

SEQUENCE LENGTH : 57 amino acids 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV peptide 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE : 
DEPOSIT : 
FEATURES 

Amino acids 1577 to 1633 of NS-3 region showing variations 
in peptide sequence between HCV types 1-3. 
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SEQUENCE LISTING f FIGURE 7) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 124 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



human blood sainples 



USE 



DEPOSIT 



FEATURES 



Amino acids 5 to 128 of core region showing variations in 
peptide sequence between HCV types 1-3, 
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SEQUENCE LISTING (FIGURE 9a) 

SEQ ID NO : 5 

SEQUENCE TYPE : Nucleotide sequence 

SEQUENCE LENGTH : 3 67 base pairs 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV genomic DNA 

ORIGINTUii SOURCE 

ORGANISM : human blood samples 

USE : 

DEPOSIT : 
FEATURES 



Bases 4911 to 5277 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING f FIGURE 9b) 



SEQ ID NO : 6 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH r 128 amino acids 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECOLAR TYPE : HCV genomic DNA 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1638 to 1765 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 10a and 10b) 

SEQ ID NO : 7 

SEQUENCE TYPE : peptide sequence 

SEQUENCE LENGTH : 90 amino acids 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV peptide 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE : 
DEPOSIT : 
FEATURES 

Amino acids 1679 to 1768 of NS-4 region of HCVl-3 showing 
individual variations and consensus sequence. 
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SE pUENCE LISTING ( FIGDRE lla and 11c) 



SEQ ID NO 



SEQUENCE TYPE : Nonaiaerxc peptide sequences 



SEQUENCE LENGTH : 9 amino acids each 



STRANDEDNESS 



single 



TOPOLOGY 



: linear 



MOLECULAR TYPE : nonameric HCV peptides 



ORIGINAL SOURCE 



ORGANISM 



synthetic 



USE 



epitope mapping 



DEPOSIT 



FEATURES 



Nonapeptides corresponding to NS4 region of HCVl-3 used 
for epitope mapping. 
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SEQUENCE LISTING (FIGURE 13) 



SEQ ID NO 



9 



SEQUENCE TYPE 



Nucleotide cDNA sequences 



SEQUENCE LENGTH 



30, 70 and 32 base pairs 



STRAKDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1 to 4 



ORIGINAL SOURCE 



USE ; 
DEPOSIT : 
FEATURES 

Bases -245 to -216; -185 to -116; and -101 to -70 in the 
5 'NCR region of HCV types 1-4. 



ORGANISM 



human blood samples 
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SEQUENCE LISTING (FIGURE ISA and 15B) 



SEQ ID NO : 10 

SEQUENCE TYPE : Nucleotide with derived protein sequence 
SEQUENCE LENGTH : 240 base pairs 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : genomic DNA 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE : 

DEPOSIT : 
FEATURES 

Base pairs 23 - 262 and amino acids 5 to 89 of the core 
region of HCV types 1-4. 
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CLAIMS 

1. A polynucleotide sequence unique to hepati-tis C virus 
type 3 or 4 (HCV-3 or HCV-4). 

2. A polynucleotide sequence according to claim 1 which 
is a cDNA sequence. 

3. An HCV-3 or HCV-4 specific polynucleotide sequence 
from the core, NS3, NS4 or NS5 regions. 

4. An HCV-3 or HCV-4 specific polynucleotide sequence 
comprised within the sec[uences shown in Figures 1 and la, 
9a, 13 and 15a. 

5. An HCV-3 or HCV-4 polynucleotide sequence which codes 
for an antigenic peptide. 

6- An HCV-3 or HCV-4 polynucleotide sequence which codes 
for C-100, 5-1-1, C33 or C22 peptides having sequence 
alterations unique to HCV-3 or HCV-4. 

7. An HCV-3 polynucleotide sequence encoding the 
polypeptide 

KPALVPDKEVLYQQYDEM or the polypeptide 

ECSQAAPYIEQAQVIAHQF 
or polypeptides of substantially equivalent antigenicity. 



wo 93/10239 



-90 - 



PCr/GB92/02143 



8. A hepatitus C virus type 2 (HCV-2) polynucleotide 
sequence encoding the polypeptide 

R(A/V)V(V/I) (A/T)PDKE(1/V)LYEAFDEM or the polypeptide 
EGAS (K/R) AALIEEGQR (M/I) AEML 
or polypeptides of substantially equivalent antigenicity. 

9. An antigenic HCV-3 or HCV-4 specific peptide. 

10. An antigenic HCV-3 or HCV-4 specific peptide from the 
core, NS3^ or NS5 regions. 

11. An antigenic HCV-3 specific peptide comprised within 
the sequences shown in Figures 3, 5 or 7. 

12. An antigenic HCV-3 or HCV-4 specific peptide vrtiich 
corresponds to c-lOO, 5-1-1, C33 or C22 proteins having 
sequence alterations unique to HCV-3 or HCV-4. 

13- An antigenic NS4 peptide specific to HCV-3 or HCV-4. 

14. An antigenic HCV-2 specific peptide which is 
R(A/V)V(V/I) (A/T)PDKE(1/V)LYEAFDEM or 

EGAS (K/R) AALIEEGQR(M/I) AEML 

15. An antigenic HCV-3 specific peptide which is 
KPALVPDKEVLYQQYDEM or 
ECSQAAPYIEQAQVIAHQF 
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16, An antigenic NS4 peptide specific to HCV-4 which 
comprises the HCV-4 sequence substantially from position 
1691 to 1708 or the sequence substantially from position 
1710 to 1728. 

17. A fusion peptide which comprises at least two of the 
antigens of claims 9 to 16. 

18* A fusion peptide according to claim 17 which 
comprises at least one of the antigens of claims 9 to 16, 
fused to 6-galactosidase, GST, trp E, or polyhedrin coding 
sequence . 

19. A peptide according to any of claims 9 to 18 which is 
labelled. 

20. A vaccine fornmlation which comprises an antigen 
according to any of claims 9-18. 

21. Antibody to the antigenic peptide of any of claims 9 
to 19. 

22. An immunoassay device which comprises a solid 
substrate having attached thereto antigen according to any 
of claims 9 to 19. 

23. A device according to claim 22 for HCV screening 
which comprises a mixture of antigens. 
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24. A device according to claim 23 wherein the antigens 
are from the core and NS4 regions. 

25. A device according to claim 22 for HCV-typing which 
comprises a plate having a series of locations 
respectively containing HCV-1, HCV-2, and HCV-3 specific 
antigens. 

26. A device according to claim 25 which further 
con^rises a location containing HCV-4 specific antigen. 

27. An immunoassay device which comprises a solid 
substrate having attached thereto antibody according to 
claim 21. 

28. A method of in vitro testing for HCV which comprises 
reverse transcribing any HCV polynucleotide present and 
amplifying by polymerase chain reaction (PGR) , and 
detecting the amplified HCV polynucleotide employing an 
HCV-2, HCV-3 or HCV-4 specific polynucleotide probe. 

29. A method of in vitro testing for HCV according to 
claim 28 wherein the primers employed for PGR are selected 
from those set out in Table 1. 
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30. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an HCV- 
containing sample employing ScrFI or Haelll/Rsal 
endonuclease; and 

- comparing the restriction patterns with 
chcoracteristic type-specific patterns. 

31. A method according to claim 30 or 31 which further 
employs Hinfl in a separate or combined digestion. 

32. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an 
HCV-containing Seunple employing ScrFI endonuclease, 
the restriction pattern being characteristic of 
HCV-1, HCV-2 and HCV-3; 

- carrying out endonuclease digestion employing Hinfl 
endonuclease, the restriction pattern being 
characteristic of HCV-4. 

33. A method of in vitro screening a sample for HCV 
antibodies which comprises; 

- carrying out an immunoassay employing an antigenic 
peptide according to any of claims 1 to 19 or mixture 
thereof; and 

- detecting any antibody - antigen complex produced. 
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34. A method of in vitxo HCV antibody typing which 
comprises separately contacting a potentially 
antibody-containing san^le or samples with a series of 
HCT-type-specific antigens; and detecting any 
antibody-antigen complex produced with each type-specific 
antigen. 
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FIGURE 2 
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FIGURE 4 
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FIGURE 6 
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FIGURE 8 
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FIGURE 9a 



NUCLEOTIDE 



SEQUESCES 



4911 



4970 



T0040 GACACACCCD GUCACAAAAU ACACCADGGC AOGCADGtTCA GCUGAUCUGG AASOAACCAC 
T0038 ??CACACCCC GOCACAAAAU ACAUCADGGC AUGCAncaCA GCDG AUC nGG AAGDAACCAC 
T0036 ??CACACCCC AETCACAAAAn ACACCADGGC ADGCADGUCA GCCGAUCtTGG AAGtXAACCAC 
T0026 ????CACCCC ADCGCGAAAtJ ACCDCADGGC AUGUADCnCA GCDGACTCUGG AAGOAACCAC 
T1787 ????CACCCC AUCACAAAAU AOSOCAUGGC ADGCADGUCA GCDGADCUGG AAGOAACCAC 



Cons GACACACCCa GDCACAAAAO ACAUCADGGC ADGCADGUCA GCDGADCUGG AAGDAACCAC 
4971 5030 

w » 

T0040 CAGCACCUGG GUGUDGCUDG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUDGDCAGU 
T003S CAGCACCUGG GUUUUUCUUG GAGGGGUCCU CG C G GC CCUA GCGGCCUACU GCUUGUCAGU 
T0036 CAGCACCUGG G DGUUG C UUG GAG G GG UC CU CGCGGCCCUA GCGGCCUACU GCUDGDCAGU 
T0026 CAGCACCUGG G UG U UCJCUUG GAGGAGUCCU CGCOGCCCUA GCGGCCUACU GCUUGUCAGU 
71787 CAGCACCUGG GUGUDGCUDG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCDUGUCAGU 



Cons CAGCACCUGG GUGUUGCUUU GAGGGG UC C U CGCGGCCCUA GCGGCCUACU GCUDGDCAGU 

5031 5090 

T0040 CGGCDGCGUU GDGAUUGDGG GDCAUAUDGA GCUGGGGGGC AAGCCOGCA AUC6DUCCAGA 
T0038 C UG L UGCUJU GDGAUDGDGG GCCADAUUGA GCUGGGGGGC AAGCCCGCA CUCGUUCCAGA 
T0036 CUUCUUmUU GDGAUUGDGG GCCADAUUGA GCUGGGGGGC AAGCCGGCA C UC G UUCCA GA 
T0026 CUUCUUCUJU GDGAUUGDGG GUCAUAUUGA GCUGGGGGGC AAGCCAGCA CUCGUUCCAGA 
T1787 C O GCUGCGUU GUGADUGUGG GUCAUAUUGA GCUGGGAGGC AAGCCGGCA CUCGUUCCAGA 



Cons CGGCDGCGUU GUGADUGUGG GUCAUAUUGA GCUGGGGGGC AAGCCGGCA ADCGUUCCAGA 

5091 5150 

T0D40 CAAAGAG6DG UUGOAUCAAC AADACGAUGA GADGGAGGAG UGCUCGCAA 6CUGCCCCADA 
TO 03 8 CAAAGAAGUG DDGOADCAAC AADACGAUGA GADGGAGGAG UGCUCGCAA GCCGCCCCAUA 
T0036 CAAAGAGGDG UUGUADCAAC AADACGAUGA GADGGAGGAG UGCUCGCAA GCCGCCCCAUA 
T0026 CAAAGAGGDG UOGUAUCAAC AADACGAUGA GADGGAGGAG UGCUCGCAA GCCGCCCCAUA 
T1787 CAACGAGGDG UUGUA DCAAC AADACGAUGA GADGGAGGAG UGCUCGCAA GCCGCCCCAUA 



Cons CAAAGAGGUG UUGUA DCAAC AAUACGADGA GADGGAGGAG UGCUCGCAA GCUGCCCCADA 



T0038 DADCGAACAA GCDCAAGUAA UAGCCCACCA GDDCAAGGAG AAAGDCCUU GGADUGCDGCA 
T0036 DADCGAACAA GCUCAGGDAA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGADUGCDGCA 
T0026 DADCGAACAA GCDCAOGUAA UAGCCCACCA GDDCAAGGAG AAAGDCCUU OGADUGCUGCA 
T1787 UAUCGAACAA GCUCAGGDAA UAGCCCACCA GDDCAAGGAG AAAGUCCUU OGCUUGCDGCA 



Cons DADCGAACAA GCDCAGGDGA DAGCCCACCA GDDCAAGGAG AAA GUCCUU 06ADDGCUGCA 

5211 5270 

T0040 GCGAGCCACC CAACAACAAG CDCDDADDGA GCCAADAGDA GCDACCAAC UGGCAAAAGCU 
T0038 GCGGGCCACC CAACAACAAG CUGUDADDGA GCCCADAGUA GCDACCAAC UGGCAAAAGCU 
T0036 GCGAGCCACC CAACAACAAG CUGDUAUUGA GCCAADAGDA GCDACCAAC DGGCAAAA7?? 
T0026 GCGAGCCACC CAACAACAAG CDGUCAUUGA GCCCADAGUA GCDACCAAC CGGCAAAA??? 
T1787 GCGAGCCACC CAACAACAGG CUGUCAUUGA GCCCADAGUA GCUACCAAC UGGCAAAAGCU 



Cons GCGAGCCACC CAACAACAAG CU G DUAUU G A GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 
5271 

9 

T0040 UGA GAC C 
T0038 UGAGGCC 
T0036 77????? 
T0026 ??????? 
T1787 UGAGGCU 



Cons UGAGACC 



5151 5210 
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FIGUBE 9b 

'^g Di; ' _ * B P JUCZSO ACID SBQUBBCBS 

1638 1787 

T0040 THPVT KYIMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 

T0038 HPVT KYIMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 

T0036 'hPIT KYIMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 

T0026 HPIA KYLMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 

T1787 iHPrP KYTMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 

Cons THPVT KYIMA CMSAD LEVTT SIWVL LGGVL AALAA YCLSV GCWI VGHIE 

1688 1-^37 

T0040 LGGKP AIVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
T0038 LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
T0036 LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
T0026 LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
T1787 LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 

Cons LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
1738 

T0040 RATQQ QAVIE PIVAT NWQKL ETFWH KHM 
T0038 RATQQ QAVIE PIVAT NWQKL EAFWH KHM 

T0036 RATQQ QAVIE PIVAT NWQ 

T0026 RATQQ QAVIE PIVAT NWQ 

T1787 RATQQ QAVIE PIVAT NWQKL EAFWH KHM 

cans RATQQ QAVIE PIVAT NWQKL EAFWH KHM 
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